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EXPRESS MAIL NO. EL487465451US 



BACTERIAL FIMBRIAL SYSTEM FOR PRESENTATION OF HETEROLOGOUS 

PEPTIDE SEQUENCES 

5 CROSS-REFERENCE TO RELATED APPLICATION 

This application claims priority to U.S. Provisional Application No. 
60/127,888, filed April 5 ? 1999, which application is incorporated by reference in its 
entirety. 

TECHNICAL FIELD OF THE INVENTION 
10 The invention relates to the use of thin aggregative fimbriae (TAF; 

SEF17) from Salmonella or its homologue, curli from Escherichia coli, to express one 
or more peptide sequences derived from heterologous, pathogenic organisms (bacteria, 
viruses, protozoa). 

BACKGROUND OF THE INVENTION 

15 Creation of Recombinant Salmonella Strains 

Several methods for generating Salmonella vaccine strains expressing 
chromosomally integrated foreign DNA have been developed. Three such methods are: 
1) specific integration into the aroC gene [Strugnell, 1990], 2) using of a defective 
transposable element [Flynn, 1990], and 3) specific integration into the his locus [Hone, 

20 1988]. 

However, use of any of these methods results in chromosomal insertion 
of the recombinant genes into regions where the wild-type (original) genes are not 
normally found. Because of this, the native promoter region(s) cannot be utilized for 
expression. Therefore, the development of strategies for heterologous antigen 
25 presentation which address the use of native promoter regions and minimize genetic 
alterations in the host chromosome are required. 
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SUMMARY OF THE INVENTION 

A high frequency chromosomal gene replacement method of general 
utility was developed for Salmonella enteritidis. This system uses an unstable, 
imperfectly segregating, temperature-sensitive replicon, pHSG415, as a carrier of the 
5 recombinant gene of interest and allows for site-specific replacement of chromosomal 
genes without the need for antibiotic resistance markers in the recombinant genes or the 
use of specific bacterial strains. This strategy was used to replace the chromosomal 
agfA fimbrin genes of & enteritidis 3b with recombinant genes containing a 48 bp DNA 
fragment encoding PT3, an immunoprotective T cell epitope from GP63 of Leishmania 

10 major. This represents the first report of fimbrial epitope replacement in the 
Salmonellae and the first chimeric fimbrin genes that have been reconstituted into a 
wild-type genetic background for any organism. 

Thin aggregative fimbriae (TAF) of S. enteritidis were modified to 
effectively present heterologous epitopes. Sixteen amino acid segments were replaced 

15 by PT3 at 1 0 different sites throughout AgfA, the major fimbrial subunit protein, chosen 
on the basis of primary protein sequence alignment, secondary structure predictions and 
comparison to a 3-D model of AgfA structure. All 10 segment replacements resulted in 
chimeric fimbrin proteins expressed by S. enteritidis from the native chromosomal 
promoter, each representing a replacement of over 10% of the total AgfA protein. 

20 Immunogold electron microscopy indicated that 8 of the 10 chimeric fimbrins were 
effectively assembled into fimbrial fibers expressed at the cell surface of S. enteritidis. 
These results demonstrate that AgfA and thin aggregative fimbriae represents a unique, 
flexible and fascile system for carrying heterologous peptide sequences. 

CsgA and curli, the E. coli homologues to AgfA and thin aggregative 

25 fimbriae, respectively, as well as the proposed minor subunit proteins, AgfB for TAF 
and CsgB for curli, which are homologues of AgfA and CsgA, are likely to be equally 
valid carriers for heterologous sequences. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Schematic of the two step overlap extension or crossover PCR 
30 protocol used to generate recombinant S. enteritidis agfA. (A) Step one of the protocol 



3 



involved a vector-bome (wavy lines) target S. enteritidis gene (grey box) that was PCR 
amplified in two fragments using two pairs of primers, (A/B, C/D, arrows). Internal 
primers encoded the foreign epitope (solid black line) whereas external primers encoded 
restriction endonuclease recognition sites (EcoRI or Hindlll) for subsequent cloning. 
5 (B) In step two, both purified PCR products were combined with external primers as 
described in Experimental procedures and PCR was used to generate gene fragments (1 
and 4) which annealed to generate the whole chimeric gene. (C) Recombinant S. 
enteritidis agfA containing 48 bp foreign Leishmania major DNA sequence encoding 
the 16 amino acid PT3 epitope [Jardim, 1990]; the protein sequence is indicated in bold 
10 type. 

Figure 2. Generation of S. enteritidis strains carrying recombinant agfA. 
A. Map of pHSG415 indicating positions of unique restriction endonuclease sites, 
antibiotic resistance genes (black arrows) and temperature-sensitive origin of replication 
(ori). Figure adapted from [Hashimoto-Gotoh, 1981]. B. Gene replacement strategy. 

15 Transformed S. enteritidis was grown at 42°C with selection pressure to induce a single 
crossover event and subsequent integration of pHSSPIO into the chromosome. Plasmid 
cointegrates were grown at 28°C without selection pressure to induce a second 
crossover event and loss of the pHSG415 sequence, resulting in one of two possibilities: 
1) wild-type agfA or 2) recombinant agfA. 

20 Figure 3. PT3 epitope replacement in AgfA. A. Peptide sequence of the 

PT3 epitope from GP63 of Leishmania major [Jardim, 1990]. Predicted secondary 
structure (Garnier-Robson algorithm, DNAStar software) is listed below the peptide 
sequence: (-) P-strand ; (x) P-helix. B. Schematic diagram of the mature AgfA protein 
illustrating the regions replaced by the PT3 epitope sequence; regions Al to A10 are 

25 indicated with the colored boxes above the sequence. The five-fold internal sequence 
homology of AgfA is represented by regions C5a-e with the consensus sequence 
Sx 5 QxGx 2 NxAx 3 Q, with the 22-residue N-terminal region listed at the bottom. 
Predicted secondary structure (Garnier-Robson algorithm, DNAStar software) is listed 
below the protein sequence: (-) p-strand ; (x) P-helix ; (*) random coil; (<) turn. 

30 Figure 3C. The predicted parallel p-helix model of AgfA (residues 23- 

130) viewed from the front side [Collinson, 1999]. The five-fold repeat segments (C5a- 
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e) and colored residues, representing regions in AgfA replaced with PT3, refer back to 
the diagram in (B). Color Code: Blue = regions A4, A5, A6, A7; Yellow = regions 
A8, A9, A10; Red = regions Al, A2; Green = blue and yellow overlap; Orange = blue, 
yellow and red overlap; Purple = blue and red overlap; Grey = unreplaced sequence. 
5 Figure 4. Western blot analysis of the 10 different S. enteritidis strains 

containing recombinant agfA in the chromosome. Formic acid treated SDS-PAGE 
sample buffer-glycine insoluble material from scraped whole cells of S. enteritidis 3b 2- 
2a (lane 2), Al (lane 3), A2 (lane 4), A3 (lane 5), A4 (lane 6), A5 (lane 7), W+ 3b (lane 
8), A6 (lane 10), A7 (lane 1 1), A8 (lane 12), A9 (lane 13), A10 (lane 14) and A+ control 

10 strain (lane 15) grown on T plates. Purified SEF17 is represented in lanes 1 and 9. The 
molecular mass markers are shown on the left of each blot. A. Blot reacted with 
immune sera raised to SEF 17. B. Blot reacted with immune sera raised to PT3. 

Figure 5. Immunogold electron microscopy of S. enteritidis strains 
containing recombinant agfA in the chromosome. 5. enteritidis strains analyzed after 

15 growth on T plates (A, C, D) or in T broth (B, E, F) for 24h at 37°C, static. 
Immunogold labeling was performed with antiserum raised to thin aggregative fimbriae 
followed by protein A- 15 nm gold (A, B, C, E) or with antiserum raised to the PT3 
epitope followed by goat-anti-rabbit-5 nm gold (D, F). A. S. enteritidis A+ control 
strain grown on T plates, reacted with immune serum raised to thin aggregative 

20 fimbriae. B. 5. enteritidis A+ control strain grown in T broth, reacted with immune 
serum raised to thin aggregative fimbriae. C. S. enteritidis strain A7 grown on T 
plates, reacted with immune serum raised to thin aggregative fimbriae. D. S. enteritidis 
strain A7 grown on T plates, reacted with immune serum raised to PT3. E. S. 
enteritidis strain A4 grown in T broth, reacted with immune serum raised to thin 

25 aggregative fimbriae. F. S. enteritidis strain A4 grown in T broth, reacted with immune 
serum raised to PT3. 

Figure 6. Resistance of the recombinant TAF fibers expressed in S. 
enteritidis to proteinase K treatment. Samples prepared as in Figure 2 but treated with 
proteinase K (0.5 mg/ml) before loading on SDS-PAGE. S. enteritidis 3b 2-2a (lane 2), 

30 Al (lane 3), A2 (lane 4), A3 (lane 5), A4 (lane 6), A5 (lane 7), W+ 3b (lane 8), A6 (lane 
10), A7 (lane 1 1), A8 (lane 12), A9 (lane 13), A10 (lane 14) and A+ control strain (lane 
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15) are represented. Purified SEF17 is represented in lanes 1 and 9. The blots were 
reacted with immune serum generated against SEF17. The molecular mass markers are 
shown on the left of each blot. A. Samples digested with proteinase K for 1 hour. B. 
Samples digested with proteinase K for 2 hours. 
5 Figure 7. The effect of formic acid concentration on the absorbance at 

600 nm of 0.5 mg/ml suspensions thin aggregative fimbriae. Inset: Immunoblot of 20 
|ug of thin aggregative fimbriae treated in one of several formic acid solutions noted 
above each lane and detected on immunoblots with fimbrin-specific immune serum as 
described in the Materials and Methods. The size of each protein is noted in kDa to the 

10 right of the blot. 

Figure 8. Immunoblot analysis of S. enteritidis native thin aggregative 
fimbriae (lanes 1 and 2) or purified fimbriae (lanes 3 and 4). Samples were untreated 
(lanes 1 and 3) or treated (lanes 2 and 4) with proteinase K before immunoblot analysis 
as described in the Materials and Methods. The protein band sizes on immunoblots are 

1 5 noted in kDa to the right of the blot. 

Figure 9. AgfA fimbrin domains and internal amino acid sequence 
homology, (a) Schematic diagram of the N-terminal (N) and C-terminal (C) domains 
of AgfA illustrating the relative positions of the two- or five-fold homologous regions 
C2a-b or C5a-e, respectively, with in the C-terminus. Values in parenthesis denote the 

20 number of amino acids present in each segment, (b) Alignment AgfA fragments C2a 
and C2b. Amino acid identity (•) and conservative replacements (underscored) are 
indicated with gaps (-) introduced for optimal alignments, (c) Alignment of AgfA 
fragments C5a to C5e with gaps (-) introduced for optimal alignments. Conserved 
residues with in each repeat are boxed, (d) The 1 8 amino acid consensus sequence of 

25 the five internal repeats where x is any amino acid. The position of each residue 
corresponds to the numbered residues in Figure 11c. (e) Position of the conserved ten 
nonpolar-polar-nonpolar (ifi) triplet motifs within each of the five 22 or 23 residue 
repeats. Non-conserved residues of each triplet are boxed. 

Figure 10. Secondary structure predictions of AgfA in which extended 

30 (e), helix (h), coil (c) or turn (t) propensity are noted under each amino acid letter 
designation. The programs Alexsis (alx) ? Hierarchial Neural Network (hie), Gamier 



6 



(gar), Gibrat (gib) and NNPredict (nnp) were used to analyze AgfA as described in the 
Materials and Methods. 

Figure 1 1 . The proposed parallel p-helix model of AgfA for residues 23 
to 130 inclusive. Molescript representation of the five coil helix model viewed form the 
5 (a) front side or (b) top. The C-terminal portion of each coil is labeled to identify the 
respective 22 or 23 amino acid repeat segment as identified in Figure 9a,c. (c) A wire 
diagram of the AgfA parallel p helix model viewed as in b but accentuating the vertical 
alignment of the five consensus segments noted in Figure 9c. The amino acids of each 
consensus sequence are numbered as indicated in Figure 9d. Carbon atoms (green) are 

10 distinguished from nitrogen (blue), oxygen (red) and sulfur (yellow). 

Figure 12. The proposed parallel p helix model of AgfA veiwed as a 
wire diagram with space filling model representation of (a) internal or (b) surface 
exposed amino acid side groups viewed from the top as in Figure lib & c. Space 
filling model representation of the AgfA P helix model viewed from the (c) front side as 

15 in Figure 1 la or (d) back side. Amino acid residues are coloured as; nonpolar (green), 
polar (red), basic (blue) or W (yellow). In Figures 12c and d the A, G and internalized 
residues are grey or white where as other surface exposed residues are coloured. 

Figure 13. (a) Molescript representation of the p barrel model of AgfA. 
The N-terminus (N) and C-terminal repeat segments (C5a to e) are numbered as in 

20 Figure 9a,c. (b) Molescript representation of PMP protein using coordinates obtained 
from SCOP on which the AgfA P barrel model was modeled as described in the 
Materials and Methods. 

Figure 14. (a) Molescript representation of the P prism model of AgfA. 
(b) Molescript representation of VMO-I protein using coordinates obtained from SCOP 

25 on which the AgfA P prism model was modeled as described in the Materials and 
Methods. 

Figure 15. Alignment of the five S. enteritidis AgfA C-terminal tandem 
repeat sequences with those of E. coli CsgA such that the amino acid residue positions 
are numbered according to Figure 9d and Figure 11c. Symbols above the numbered 
30 amino acids indicate the positions of proposed internalized (•), surface exposed (o), turn 
(t) residues or the two nonpolar-polar-nonpolar triplets (ifi) within the parallel P helix 
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model of AgfA. Surface or turn residues are colour coded: polar or acidic (red), basic 
(blue), nonpolar (green), W (yellow) and G (black). Proposed internalized residues are 
noted in black. 

Figure 16. Alignments of AgfA with p structural motifs of the three 
5 template proteins of known structure used to assemble AgfA models. AgfA sequence 
was aligned with the (a) |3 roll motif sequence of Serratia marcescens protease (SMP) 
(Baumann, 1994; Braunagel & Benedik, 1990) (b) (3 barrel motif of bovine myelin P2 
protein (PMP) (Jones et ah, 1988), (c) [3 prism motif of the vitelline membrane outer 
layer protein I (VMO-I) (Shimizu & Morikawa, 1996). Bold characters indicate 
10 nonpolar-polar-nonpolar motifs, gaps (-) were introduced for optimal alignments. 
Residues are numbered according to published sequences. 

Figure 17. Schematic of the two step overlap extension or crossover 
yn PCR protocol used to generate chimeric 5. enteritidis genes. (A) Step one of the 

^ protocol involved a plasmid-borne (wavy lines) target S. enteritidis gene (grey box) that 

W 15 was PCR amplified in two fragments using two pairs of primers, (A/B, C/D, arrows). 

□ Internal primers encoded the foreign epitope (solid black line) whereas external primers 

encoded restriction endonuclease recognition sites (EcoRl or Hindlll) for subsequent 
^ cloning. (B) In step two, both purified PCR products were combined with external 

p primers as described in Experimental procedures and PCR was used to generate gene 

%i 20 fragments (1 and 4) which annealed to generate the whole chimeric gene. (C) Chimeric 

^ S. enteritidis sefA and agfA fimbrin genes containing 48 bp foreign Leishmania major 

DNA sequence encoding the 16 amino acid PT3 epitope [25]; the protein sequence is 
indicated in bold type. 

Figure 18. Generation of S. enteritidis strains carrying chimeric fimbrin 
25 genes. A. Map of pHSG415 indicating positions of unique restriction endonuclease 
sites, antibiotic resistance genes (black arrows) and temperature-sensitive origin of 
replication (ori). Figure adapted from [20]. B. Gene replacement strategy (shown for 
sefA::PT3). Transformed S. enteritidis was grown at 42°C with selection pressure to 
induce a single crossover event and subsequent integration of pHSSPIO into the 
30 chromosome. Plasmid cointegrates were grown at 28°C without selection pressure to 
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induce a second crossover event and loss of the pHSG415 sequence, resulting in one of 
two possibilities: 1) wild-type fimbrin gene or 2) chimeric fimbrin gene. 

Figure 19. Western blot analysis to detect expression of chimeric 
sefA::PT3 and agfA::PT3 in E. coli. A. Whole cell ly sates of E. coli XL-1 Blue 
5 harboring pGPl-2 with pTZ18R (lane 1), pTZSef (lane 2), and pTZSPIO (lane 3) 
reacted with anti-SEF14 and -SEF21 immune serum. Purified SEF14 is represented in 
lane 4. B. Whole cell lysates of E. coli XL-1 Blue harboring pGPl-2 and pTZ18R 
(lanes 2 and 5), pTZAgf (lanes 3 and 6), or pTZAP7 (lanes 4 and 7) with (+FA) or 
without (-FA) formic acid treatment reacted with immune sera raised to SEF17. Lane 1 

10 contains FA treated, purified, thin aggregative fimbriae. The sizes (kDa) of the 
molecular mass markers are indicated on the left. 

Figure 20. Western blot analysis of Ap s S. enteritidis strains after 
sefA::PT3 gene replacement. Whole cell lysates of strains #11 (lane 2), #13 (lane 3), 
#192 (lane 4), #197 (lane 5), and W + 3b (lane 6) reacted with anti-SEF14 and -SEF21 

15 immune serum. Purified SEF14 is represented in lane 1. The sizes (kDa) of the 
molecular mass markers are indicated on the left. 

Figure 21. Western blot analysis of Ap s S. enter itidis strains after 
agfA::PT3 gene replacement. Whole cell SDS-PAGE sample buffer-glycine extracts of 
S. enteritidis strains #91 (lane 2), #102 (lane 3), #103 (lane 4), #104 (lane 5), W + 3b 

20 (lane 6) and strain #27 (lane 7) without formic acid treatment (A; -FA), and SDS-PAGE 
sample buffer-glycine insoluble material with formic acid treatment (B; +FA) reacted 
with immune sera raised to SEF17. Purified SEF17 is represented in lane 1. The sizes 
(kDa) of the molecular mass markers are indicated on the left. 

Figure 22. Western blot analysis of T-broth culture supernatant proteins 

25 fromS. enteritidis strains containing agfA::PT3. SDS-PAGE sample buffer-glycine 
extracts of acetone-precipitated culture supernatant proteins from S. enteritidis strains 
#102 (lane 2 and 5) and #103 (lane 3 and 6). Purified SEF17 is represented in lanes 1 
and 4. Lanes 1-3 were reacted with immune sera raised to whole SEF17, whereas Lanes 
4-6 were reacted with immune sera raised to PT3 peptide. The sizes (kDa) of the 

30 molecular mass markers are indicated on the left. 
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Figure 23. PT3 epitope replacement in SefA. A. Peptide sequence of 
the PT3 epitope from GP63 of Leishmania major (Jardim et al., 1990). B. Schematic 
diagram of the mature SefA protein illustrating the regions replaced by the PT3 epitope 
sequence; regions SI to S10 are indicated with the colored boxes above the sequence. 
5 Color Code: Black = region SI; Orange = region S10; Blue = regions S2, S3, S4, S5; 
Green = regions S6, S7, S8, S9. 

Figure 24. Footpad inflammation of vaccinated BALB/c mice after 
Leishmania major challenge. Mice were vaccinated with a single oral dose (A, C) of S. 
enteritidis 3b or A4 (10 5 organisms/mouse) or PBS alone or two oral doses (B, D) of S, 

10 enteritidis 3b or A4 (10 5 and 10 3 organisms/mouse) or PBS alone spaced 2 weeks apart. 
They were challenged with L. major in the footpad 6 weeks (A, B; 10 7 
organisms/mouse) or 8 weeks (C, D; 5 x 10 6 organisms/mouse) after first vaccination. 
Footpad inflammation was measured weekly as the increase in footpad thickness 
(subtracting the thickness of unifected footpad from that of the infected footpad). 

15 Symbols on graphs: (circle) S. enteritidis A4 vaccinated mice; (triangle) S. enteritidis 
3b vaccinated mice; and (square) PBS -vaccinated control mice. Data are means +/- 
standard errors of the mean (n = 5). 

Figure 25. Quantification of Leishmania major parasites in popliteal 
ganglions from infected BALB/c mice. L. major carrying a reporter gene was 

20 quantified by measuring reporter gene activity after performing a limiting dilution assay 
(Titus et aL, 1985). Columns represent total reporter gene units (A) and reporter gene 
units per mg of organ (B) as the mean values from each group: 3-1 , one dose of PBS 
only; 3-2 and 3-3, one vaccination with 10 5 S. enteritidis 3b or A4, respectively; 3-4, 
two doses of PBS only; 3-5 and 3-6, two vaccinations with 10 5 and 10 3 S. enteritidis 3b 

25 or A4, respectively. All groups of mice were challenged in the footpad with 5 x 10 6 live 
L. major eight weeks after first vaccination. Standard errors of the mean for each group 
are listed below the appropriate columns at the bottom of each graph. 

Figure 26. Measurement of serum antibodies against PT3 peptide, 
SEF17 and AgfA. BALB/c mouse serum was tested in triplicate (1 in 100 dilution) by 

30 ELISA against 1 ng/well of PT3 peptide (A), polymerized SEF17 (B), or 
depolymerized SEF17 pre-treated with 90% formic acid (C; Collinson et aL, 1991). 
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Results shown are the mean absorbance values from each group of mice: 1-1, 2-1, 3-1, 
one dose of PBS only; 1-2, 2-2, 3-2, one vaccination with 10 5 S. enteritidis 3b; 1-3, 2-3, 
3-3, one vaccination with 10 5 S. enteritidis A4; 1-4, 2-4, 3-4, two doses of PBS only; 1- 
5, 2-5, 3-5, two vaccinations with 10 5 and 10 3 S. enteritidis 3b; 1-6, 2-6, 3-6, two 
5 vaccinations with 10 5 and 10 3 S. enteritidis A4. Groups 1-1 to 1-6 and 2-1 to 2-6 were 
challenged in the footpad with 10 7 live L. major four and six weeks after first 
vaccination, repectively, and groups 3-1 to 3-6 were challenged in the footpad with 5 x 
10 6 live L. major eight weeks after first vaccination. 



10 Sequence Listing 

The nucleic and amino acid sequences listed in the accompanying 

sequence listing are shown using standard letter abbreviations for nucleotide bases, and 

three letter code for amino acids. Only one strand of each nucleic acid sequence is 

shown, but the complementary strand is understood as included by any reference to the 
15 displayed strand. 

SEQ ID: 1 shows the agfA DNA sequence 

SEQ ID: 2 shows the agfB DNA sequence 

SEQ ID: 3 shows the csgA DNA sequence 

SEQ ID: 4 shows the csgB DNA sequence 
20 SEQ ID: 5 shows the AgfA amino acid sequence 

SEQ ID: 6 shows the AgfB amino acid sequence 

SEQ ID: 7 shows the CsgA amino acid sequence 

SEQ ID: 8 shows the CsgB amino acid sequence 

SEQ ID: 9 shows the DNA sequence encoding the Leishmania major PT3 epitope 
25 SEQ ID: 10 shows the Leishmania major PT3 epitope amino acid sequence 

SEQ ID: 1 1 shows the agfA::PT3#\ DNA sequence 

SEQ ID: 12 shows the AgfA::PT3#l amino acid sequence 

SEQ ID: 13 shows the agfA::PT3#2 DNA sequence 

SEQ ID: 14 shows the AgfA::PT3#2 amino acid sequence 
30 SEQ ID: 15 shows the agfA::PT3#3 DNA sequence 
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SEQ ID: 16 shows the AgfA::PT3#3 amino acid sequence 
SEQ ID: 17 shows the agfA::PT3#4 DNA sequence 
SEQ ID: 18 shows the AgfA::PT3#4 amino acid sequence 
SEQ ID: 19 shows the agfA::PT3#5 DNA sequence 
5 SEQ ID: 20 shows the AgfA::PT3#5 amino acid sequence 
SEQ ID: 21 shows the agfA::PT3#6 DNA sequence 
SEQ ID: 22 shows the AgfA::PT3#6 amino acid sequence 
SEQ ID: 23 shows the agfA::PT3#l DNA sequence 
SEQ ID: 24 shows the AgfA::PT3#7 amino acid sequence 

10 SEQ ID: 25 shows the agfA::PT3#S DNA sequence 

SEQ ID: 26 shows the AgfA::PT3#8 amino acid sequence 
SEQ ID: 27 shows the agfA::PT3#9 DNA sequence 
SEQ ID: 28 shows the AgfA::PT3#9 amino acid sequence 
SEQ ID: 29 shows the agfA::PT3#\0 DNA sequence 

15 SEQ ID: 30 shows the AgfA::PT3#10 amino acid sequence 



DETAILED DESCRIPTION OF THE INVENTION 



Definitions 

Fimbriae (pili): Fimbriae are proteinaceous, hairlike appendages 
produced on the surface of a wide range of bacteria. Fimbriae are generally shorter 

20 (several hundred or thousand nm long) and more numerous than flagella (between 100 
and 1000 fimbriae per cell) and are usually not involved in cellular motility. Detailed 
studies of several fimbriae from a variety of animal and human pathogens have revealed 
that in some case these appendages facilitate bacterial adherence to host tissues 
[Hultgren, 1996]. Fimbriae are composed of a repeated major subunit protein (fimbrin) 

25 and sometimes several minor subunit-like proteins. Fimbrins are polymerized by 
hydrophobic and hydrophilic interactions to form either thick (7 to 8 nm), rigid 
structures, thin (2 to 4 nm), flexible filaments or composites of both [Low, 1996]. 

Thin aggregative fimbriae: Thin aggregative fimbriae (TAF; SEF17) 
from S. enteritidis are thin fibers (3 to 4 nm in diameter) expressed by cells in stationary 
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phase and under conditions of starvation or low osmolarity [Collinson, 1991]. They 
are extremely aggregative, mediate fibronectin binding [Collinson, 1993], and their 
expression confers a rough aggregative colony morphology. These fimbriae are 
characterized as being extremely stable structures; they are not solublized in 5M NaOH 
5 or in boiling 0.5% deoxycholate, 8M urea, or 1 to 2% sodium dodecyl sulfate (SDS) 
with or without 5% (5-mercaptoethanol [Collinson, 1991]. Furthermore, these fimbriae 
remain structurally intact after digestion with proteinase K for up to 2 hours [Collinson, 
1999]. Thus far, depolymerization of the fimbriae and release of the fimbrin subunit 
proteins has only been accomplished by exposing the fimbriae to 90% formic acid 
10 [Collinson, 1991;Collinson, 1999]. To our knowledge, this is the only fimbrial type to 
require such harsh conditions for depolymerization and are thus considered super-stable 
fimbriae. 

Fimbriae as Carriers of heterologous epitopes 

Fimbriae are proteinaceous, filamentous cell surface structures primarily 

15 composed of helically arranged, identical, protein subunits called fimbrins. There are 
up to 1000 fimbrins per fimbriae and as many as 500 fimbriae per cell. They are found 
on a wide variety of pathogenic bacteria and are thought to be involved in the 
colonization of host cell surfaces [Hultgren, 1993]. Therefore, fimbriae may be 
important for the disease process in vivo. 

20 Advantages of a fimbrial presentation system are the following: 1) the 

heterologous antigens will be presented in high numbers (up to 500,000 copies per cell), 
2) the hybrid fimbrin protein may possess both the immunogenicity and adhesion 
properties relevant for an efficient live vaccine, 3) the carrier fimbrial subunit proteins 
are usually strong immunogens, which may be important for directing an immune 

25 response against the inserted epitope, and 4) hybrid fimbriae are easy and inexpensive 
to purify in large amounts (could be used as a vaccine themselves) [Pallesen, 1994]. 
These advantages make a fimbrial expression system very attractive. 

Several strains of £. coli produce fimbriae named curli which are highly 
related to thin aggregative fimbriae from Salmonella spp. Both TAF and curli are 

30 primarily composed of one major subunit protein, AgfA and CsgA, respectively. The 
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primary sequence of AgfA and CsgA are 74% identical and 86% conserved [Collinson, 
1996]; no other characterized fimbrial proteins in existing sequence databases have 
notable sequence similarity to either protein. Curli have also been extensively 
characterized [Bian, 1997; Hammar, 1995; Hammar, 1996; Olsen, 1993; Romling, 
5 1998; Romling, 1998]; the operon encoding curli production has been sequenced and 
characterized in E. coli [Hammar, 1995] and was recently shown to be conserved 
(identity of 78%) in Salmonella typhimurium [Romling, 1998], which also produces 
thin aggregative fimbriae. This evidence indicates that thin aggregative fimbriae and 
curli are both members of the same distinct class of fimbriae. The gene for TAF, agfA 

10 (csgA) has been detected in 99.8% (603/604) of Salmonella isolates, but is less 
representative in other members of the Enter obacteriaceae, including only 19.0% 
(26/137) E. coli strains [Doran, 1993]. agfA is therefore genotypic of Salmonella spp. 
and occasionally found in E. coli and its subspecies, suggesting its origins were in 
Salmonella spp. and coopted into E. coll 

15 AgfA is characterized as having a two-domain structure with a 22 

residue protease-susceptible N-terminal region and a 109 residue protease-resistant C- 
terminal core region [Collinson, 1999]. This C-terminal core region has five-fold 
sequence homology as represented in Fig 5B and is predicted to comprise a primarily 
(}-sheet structure. The first 18 amino acids of each of the five segments conformed to 

20 the consensus sequence Sx 5 QxGx 2 NxAx 3 Q separated by 4 or 5 additional residues 
except for the terminal repeat which ended in Y. Analysis of this five-fold sequence 
homology has yielded a hypothetical, parallel |3~helix, three-dimensional structure for 
AgfA [Collinson, 1999] and this model is displayed in Figure 5C. 

AgfB is proposed to be a minor component of thin aggregative fimbriae 

25 and shares high secondary structure similarity to AgfA. It is the same size as AgfA and 
the overall primary sequence homology is 22% identity and 42% similarity. The 
majority of conserved residues keep the consensus sequence intact (described above for 
AgfA); based on this, AgfB is proposed to have a similar three dimensional structure. 

Assembly of thin aggregative fimbriae: The majority of fimbrial 

30 assembly systems in Gram negative bacteria follow the chaperone/usher pathway 
[Hultgren, 1996]. In this system, the chaperone protein facilitates the release of fimbrin 
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subunits from the cytoplasmic membrane, prevents premature aggregation and targets 
the subunits to the usher protein in the outer membrane which allows them to pass 
through and assemble into fimbrial fibers [Thanassi, 1998]. The prototypical fimbriae 
for this system are the Pap or P fimbriae from E. coli and Type 1 fimbriae from E. coli 
5 and Salmonella [Hultgren, 1996]. Another well characterized assembly system is that 
of Type IV fimbriae produced by Pseudomonas aeruginosa, Neisseria spp., Moraxella 
bovis, Dichelobacter nodosus, and Vibrio cholerae. These fimbriae apparently bypass 
the need for specialized chaperone and usher proteins and utilize a branch of the general 
secretion pathway (type II) which is widely used for the secretion of many other 

10 extracellularly targeted proteins [Salmond, 1996]. Many homologues to the fimbrial 
types described above have been characterized; almost without exception, growth of the 
fimbrial fiber is achieved from the base by addition of the subunits from the periplasmic 
side of the outer membrane. 

In contrast, thin aggregative fimbriae (curli) are proposed to assemble 

15 via a unique and novel mechanism described as extracellular, nucleator-dependent 
assembly [Hammar, 1996]. In this model, AgfA monomers are secreted from the cell 
(across the outer membrane) in a soluble, polymerization-competent form and are 
proposed to polymerize into insoluble fimbrial fibers on the cell surface upon 
interaction with a cell-bound nucleator. AgfB is proposed to function as the nucleator, 

20 inducing a conformational change in soluble AgfA leading to polymerization [Bian, 
1997]. Once an initial interaction has occurred between AgfB and an AgfA monomer, 
the extension of the fiber is probably self-driven with the addition of AgfA monomers 
to the free, distal end. AgfB is also proposed to be a minor component of the fibers. 
Several other proteins are involved in biogenesis of TAF; AgfG may perform a 

25 chaperone-like function to protect AgfA and AgfB from proteolytic degradation or may 
form a multimeric channel in the outer membrane to allow passage of AgfA or AgfB 
[Loferer, 1997], AgfD acts as a positive transcriptional regulator for TAF expression 
[Hammar, 1995], AgfE is proposed to be involved in Congo red and fibronectin binding 
and AgfF is proposed to be involved in nucleation [Romling, 1998]. 

30 The conformational change proposed to occur in AgfA might involve a 

conversion from a partially disordered structure in the monomeric state to readily 
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ordered secondary structures in the polymeric state as has been observed with assembly 
of the bacterial flagellar filament [Aizawa, 1990]. TAF assembly would be driven then 
by conformationally altered AgfA which could bind the next soluble AgfA monomer 
and induce a similar conformational change. Repetition of this process would result in 

5 the formation of the long-insoluble fibers observed with TAF and bears similarity to the 
scrapie protein changing from soluble to insoluble upon formation of rod-like prion 
superstructures [Cohen, 1994]. This assembly model is unique among bacterial surface 
organelles characterized to date. 

Epitope: An epitope refers to an immunologically active region of an 

10 immunogen (protein) that binds to specific membrane receptors for antigen on 
lymphocytes or to secreted antibodies [Kuby, 1994]. To generate an immune response 
to a foreign antigen, lymphocytes and antibodies recognize these specific regions 
(epitopes) of the antigen rather than the entire molecule. 

B cell epitope: The region of an immunogen (protein, polysaccharide, or 

15 lipid) which is recognized by B cells when it binds to their membrane bound antibody. 
The B cells which recognize that particular region then proliferate and secrete antibody 
molecules which are specific for that region of the immunogen. B cell epitopes tend to 
be highly accessible regions on the exposed surface of the immunogen. Stimulation of 
the immune system by B cell epitopes results in "humoral" immunity. 

20 T cell epitope: The region (epitope) of an immunogen which is 

recognized by a receptor on T cells after being processed and presented on the surface 
of an antigen presenting cell (APC) in the context of a major histocompatability 
complex (MHC) class I or II molecule. T cells can be split into two distinct groups, T 
helper cells (T h ) and T cytotoxic cells (T c ). T helper cells recognize epitopes bound to 

25 MHC class II molecules whereas T cytotoxic cells recognize epitopes bound to MHC 
class I molecules. T helper cells can be further subdivided into two classes, T hl and T h2? 
Thl being responsible for stimulation of cell-mediated immunity and Th2 cells 
stimulating the humoral arm of the immune system. When a given T cell recognizes 
the epitope-MHC complex at the surface of the APC it becomes stimulated and 

30 proliferates, leading to the production of a large number of T cells with receptors 
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specific for the stimulating epitope. Stimulation of the immune system by T cell 
epitopes normally results in "cell-mediated" immunity. 

Chromosomal gene replacements: This refers to the replacement of a 
wild-type gene sequence in the bacterial chromosome with an in v/Yro-altered 
5 recombinant gene sequence. 

Attenuated Bacterial Vaccine: This refers to bacterial strains which have 
lost their pathogenicity while retaining their capacity for transient growth within an 
inoculated host. Because of their capacity for transient growth, such vaccines provide 
prolonged immune-system exposure to the individual epitopes on the attenuated 

10 organisms, resulting in increased immunogenicity and memory-cell production, which 
sometimes eliminates the need for repeated booster injections. The ability of many 
attenuated vaccines to replicate within host cells makes them very suitable to induce a 
cell-mediated immunity. Typically, bacterial strains are made attenuated by introducing 
multiple defined gene mutations into the chromosome thereby impairing growth in vivo. 

15 Recombinant Vector Vaccine: This refers to the introduction of genes 

(or pieces of genes) encoding major antigens (or epitopes) from especially virulent 
pathogens into attenuated viruses or bacteria. The attenuated organism serves as a 
vector, replicating within the host and expressing the gene product of the pathogen. 

Sequence Identity: Identity between two nucleic acid sequences, or two 

20 amino acid sequences is expressed in terms of the level of identical residues shared 
between the sequences. Sequence identity is typically expressed in terms of percentage 
identity; the higher the percentage, the more similar the two sequences are. 

Sequence Similarity: Similarity between two amino acid sequences is 
expressed in terms of the level of sequence conservation, including shared identical 

25 residues and those residues which differ but which share a similar size, polarity, charge 
or hydrophobicity. Sequence similarity is typically expressed in terms of percentage 
similarity; the higher the percentage, the more similar the two sequences are. 

Recombinant: A recombinant nucleic acid is one that has a sequence that 
is not normally occurring or has a sequence that is made by an artificial combination of 

30 two otherwise separated segments of sequence. This artificial combination is often 
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accomplished by chemical synthesis or, more commonly, by the artificial manipulation 
of isolated segments of nucleic acids, e.g., by genetic engineering techniques. 

Oligonucleotide (oligo): A linear polymer sequence of up to about 100 
nucleotide bases in length. 

5 Probes and primers: Nucleic acid probes and primers may readily be 

prepared based on the amino acid and DNA sequence provided by this invention. A 
probe comprises an isolated nucleic acid attached to a detectable label or reporter 
molecule. Typical labels induce radioactive isotopes, ligands, chemiluminescent 
agents, and enzymes. Methods for labeling and guidance in the choice of labels 

10 appropriate for various purposes are discussed, e.g., in Sambrook et al. [Sambrook, 
1989]. 

Primers are short nucleic acids, preferably DNA oligonucleotides 15 
nucleotides or more in length. Primers may be annealed to a complementary target 
DNA strand, and then extended along the target DNA strand by a DNA polymerase 
15 enzyme. Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by 
the polymerase chain reaction (PCR) or other nucleic-acid amplification methods 
known in the art. 

Methods for preparing and using probes and primers are described, for 
example, in Sambrook, 1989, Ausubel, 1987, and Innis, 1990. PCR primer pairs can be 

20 derived from a known sequence, for example, by using computer programs intended for 
that purpose such as DNAStar Lazergene software. One of skill in the art will 
appreciate that the specificity of a particular probe or primer increases with its length. 
Thus, for example, a primer comprising 20 consecutive nucleotides will anneal to a 
target with a higher specificity than a corresponding primer of only 15 nucleotides. 

25 Thus, in order to obtain greater specificity, probes and primers may be selected that 
comprise 20, 25, 30, 35, 40, 50 or more consecutive nucleotides. 

Isolated: An "isolated" biological component (such as nucleic acid or 
protein or organelle) has been substantially separated or purified away from other 
biological components in the cell of the organism in which the component naturally 

30 occurs, i.e., other chromosomal and extra-chromosomal DNA and RNA, proteins and 
organelles. Nucleic acids and proteins that have been "isolated" include nucleic acids 
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and proteins purified by standard purification methods. The term also embraces nucleic 
acids and proteins prepared by recombinant expression in a host cell as well as 
chemically synthesized nucleic acids. An "isolated" bacterial strain or colony is 
purified away from other colonies and yields a pure culture without any contaminants 
5 upon plating on selective media. 

Vector: A nucleic acid molecule as introduced into a host cell, thereby 
producing a transformed host cell. A vector may include nucleic acid sequences that 
permit it to replicate in a host cell, such as an origin of replication. A vector may also 
include one or more selectable marker genes and other genetic elements known in the 

10 art. A "temperature-sensitive" vector is one which replicates normally at a low growth 
temperature (i.e., 28°C) and will not replicate at a higher growth temperature (i.e., 
42°C) due to mutations at or near the origin of replication. An "imperfectly 
segregating" vector is one which is not stably inherited by new daughter cells at the 
time of cell division in the absence of selection pressure due to mutations within the 

15 vector sequence. 

Operon: An operon is the term used to describe a DNA sequence that 
codes for one or more polypeptides (structural genes), usually of related function, and a 
DNA sequence that regulates the expression of these genes. 

Selection of Salmonella and thin aggregative fimbriae 

20 Salmonella spp. are well developed vaccine vectors [Hackett, 1993]. 

Attenuated Salmonella strains can elicit protective immunity and induce secretory, 
humoral and cellular anti-Salmonella responses in hosts following oral immunization 
[Levine, 1996]. In addition, most Salmonella spp. are facultative intracellular 
pathogens [Fields, 1986] with a highly characterized invasion pathway [Galan, 1996] 

25 and can express antigens inside of host cells. These features, when combined with the 
ease of genetic manipulation in Salmonella spp., makes these facultative intracellular 
pathogens excellent candidates as vaccine vectors for the presentation of protective 
heterologous antigens [Curtiss III, 1994], 

Thin aggregative fimbriae were selected as the carriers of heterologous 

30 sequences because they represented a novel, super-stable class of fimbriae among the 



19 



enterobacteriacea [Low, 1996], They can be expressed at high levels and are surface- 
exposed [Collinson, 1991]. The biochemical and immunological characteristics of the 
major subunit protein, AgfA, have been well characterized [Collinson, 1991; Collinson, 
1992; Collinson, 1993] and most of the genes in the operon required for assembly have 
5 been identified [Hammar, 1995; Collinson, 1996; Romling, 1998]. The structural gene, 
agfA, of TAF was found to be widely distributed and is probably common among 
Salmonella spp. All of these factors made TAF an ideal choice for testing as a carrier of 
heterologous sequences. 

Selection of the PT3 epitope 

10 Several immunoprotective T cell epitopes have been identified from the 

GP63 protein of Leishmania major, and were found to stimulate the proliferation of 
lymphocytes from mice immunized with whole GP63 [Jardim, 1990]. One of these 
epitopes, PT3, comprising residues 154-168 of the mature protein, specifically induced 
proliferation of CD4 + T cells of the T hl subset, known to be involved in the 

15 development of cell-mediated immunity [Jardim, 1994]. A further study proved that a 
single immunization with synthetic PT3 in adjuvant was able to protect Balb/c mice 
against challenge with Z. major promastigotes [Spitzer, 1998]. Furthermore, studies 
with Salmonella vaccine strains expressing GP63 from an extrachromosomal vector 
have demonstrated that oral vaccination against Leishmania infection is possible 

20 [McSorley, 1997]. Therefore, PT3 is a good candidate for developing and testing a 
heterologous fimbrial vaccine system in Salmonella. 

Method of gene replacement within the Salmonella enter itidis agfBAC operon 

Several methods of chromosomal gene replacement have been developed 
for the creation of Salmonella vaccine vectors (discussed above). However, these 
25 methods insert the recombinant genes into the chromosome at non-native sites. As a 
consequence, expression of the recombinant genes would be predicted to be altered. 
There is a need for a method of chromosomal gene replacement in Salmonella to insert 
recombinant genes into native regions normally occupied by the wild-type genes. Two 
such methods exist for chromosomal gene replacements in E. coli. The method by 
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Hamilton et al. (1989) relies on the use of a temperature sensitive pSC 101 -derived 
vector to perform the replacements. Gene replacement by this method does not leave 
(or add) any extraneous DNA elements and one drawback is having to cure final strains 
of freely replicating vectors. A more recent method proposed by Link et al* (1997) uses 
5 a vector related to the one used by Hamilton et al. (1989) with the addition of the sacB 
selectable marker to the vector to allow for selection for loss of the vector sequence 
upon gene replacement. 

The method of chromosomal gene replacement according to the present 
invention also uses a temperature-sensitive pSC 101 -derived vector, pHSG415 
10 [Hashimoto-Gotoh, 1981], but has some notable differences. 

Generation of recombinant S. enteritidis fimbrin genes. 

The S. enteritidis 3b agfA fimbrin gene was chosen to receive a site- 
specific epitope replacement with the PT3 epitope. The two-step overlap extension 
PCR method of Horton et al. [Horton, 1989] was used (Figure 1A, B) to replace 48 bp 
15 of the fimbrin gene with 48 bp encoding the PT3 T cell epitope from L. major (Figure 
1C). The PCR-generated gene encoded an AgfA protein carrying the PT3 epitope 
within a specific region (residues 111-126 in AgfA). The resultant agfA::PT3 PCR 
products were sequenced and cloned into pHSG415 (Figure 2A) to generate the 
recombinant temperature-sensitive vector pHSAP7. 

20 Gene Replacement in S. enteritidis. 

S. enteritidis 3b cells were transformed with pHSAP7 and grown 24h in 
liquid media (Terrific broth with Ampicillin; TFB/Ap). This culture was transfered five 
consecutive times into fresh media for a total growth period of 120 h before plating on 
LB/Ap. 50 isolated colonies were picked and 100% were identified by PCR to contain 

25 the integrated vector sequence. Four pHSAP7 cointegrate colonies were selected and 
grown at the permissive temperature of 28°C in liquid media (TFB) without Ap 
selection pressure for 24 h and tranferred five consecutive times into fresh media for a 
total growth period of 120 h at 28°C. Colonies were plated onto LB and LB/Ap plates 
to determine the percentage of Ap s colonies. 184 isolated colonies were picked from 
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each cointegrate culture, yielding 27%, 15%, 10% and 9% Ap s from the four pHSAP7 
cointegrates as shown in Table 1 . 

5 Table L Efficiency of agfA chromosomal gene replacement in & enteritidis. 

Plasmid cointegrate Ap s colonies 8 Chimeric gene 

colonies (%) # % replacements 

pHSAP7 

Isolate #2 18 10 7 b 

Isolate #3 28 15 10 b 

Isolate #9 49 27 4 b 

Isolate #10 17 9 8 b 
a 184 individual colonies picked in total. 

b Frequency of agfA::PT3 containing S. enteritidis estimated by CR binding morphology (TCR). 



10 Genetic analysis of S. enteritidis strains containing recombinant agfA::PT3. 

To identify S. enteritidis strains containing the chimeric agfA::PT3 gene, 
five of the final Ap s colonies were chosen and were analyzed by PCR. Three strains 
were identified as having the PT3 DNA sequence within agfA and all five strains were 
negative for the pSClOl ori sequence as above. PCR products from each strain were 

15 the same size as PCR products of agfA from S. enteritidis, indicating that there was no 
additional plasmid sequence inserted in this region. Final confirmation of the genotypes 
of these strains was obtained by DNA sequencing proving that three strains contained 
agfA::PT3 and two strains contained wild-type agfA. Thus, these data confirmed the 
creation of S. enteritidis strains containing chimeric sefA::PT3 or agfA:;PT3 fimbrin 

20 genes by precise recombination events. 

The existing methods of chromosomal gene replacement are less 
favorable than the present method. The method of Hamilton et al [Hamilton, 1989] 
results in final strains containing freely replicating plasmids which must be cured before 
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selecting gene replacements, whereas the method of the invention results in a high 
proportion of final strains which have lost the excised plasmid altogether and can be 
screened directly by PCR. The method of Link et al [Link, 1997] includes the Bacillus 
subtilis sacB system to enable direct selection for excision and loss of integrated 

5 plasmids. However, when creating deletion mutants without the use of selectable 
markers, these authors reported lower frequencies than are reported here (Table 1). A 
recent study by Edwards et al [Edwards, 1998] reported the replacement of sefA or 
agfA within S. enteritidis at lower frequencies than reported here, even with the use of a 
Km R marker in the recombinant genes. This supports the usefulness of the present 

10 method and demonstrates that the high replacement frequencies reported here are not 
simply due to the ease of sefA or agfA replacement. Therefore, this method has broad 
application and appeal for gene replacement in Salmonella and other enteric bacteria 
capable of supporting the replication of pSCl 01 -derived plasmids. In addition, the 
present results suggest that the use of similar "unstable" plasmids in other organisms 

1 5 could represent a more generalized mutagenesis strategy. 

A closer look at the properties of pHSG415 can help explain why this 
gene replacement method is so efficient. In contrast to pSClOl, pHSG415 was shown 
to be extremely unstable in liquid culture in the absence of selection pressure, with as 
few as 60% of cells carrying the plasmid after 24 hr [Caulcott, 1987]. These authors 

20 speculated that the difference between pSClOl and pHSG415 was due to the absence of 
most of the par locus in pHSG415. The pSClOl par locus has been very well 
characterized; it has been shown to alter the binding of proteins within the origin region 
[Ingmer, 1993] and thus to enhance DNA replication [Manen, 1990]. In addition, it 
contains a preferential binding site for DNA gyrase [Wahle, 1988] and alters the 

25 negative supercoiling of DNA [Miller, 1990]. Through a combination of these effects 
the par locus promotes partitioning of plasmids to daughter cells at the time of cell 
division [Meacock, 1980]. Due to the deletion of the par locus, pHSG415 is proposed 
to have a defect in its segregating properties causing unequal partitioning of plasmid 
molecules to daughter cells at the time of cell division [Caulcott, 1987]. Although the 

30 present invention is not bound by a specific mechanisms,, it is proposed that the present 
method uses the unstable properties of pHSG415 to advantage, and in the absence of 



23 



selection pressure, integrated plasmids are lost immediately after the second crossover 
event or are retained for several generations and then lost. Thus, a replacement of the 
agfA gene in a wild-type strain of S. enteritidis was performed at an efficiency of 10% 
without any accompanying antibiotic resistance markers in the target genes. 

5 Specific DNA sequence replacements within agfA 

Expression of Heterologous Epitopes in E. coli Fimbriae 

Several E. coli fimbrial types have been utilized for the presentation of 
heterologous sequences. The fimbriae used as carriers thus far have been type 1 
fimbriae [Hedegaard, 1989; Stentebjerg-Olesen, 1997], K88 fimbriae [Bakker, 1990; 

10 Thiry, 1989], P fimbriae [van der Zee, 1995; van Die, 1990], CS31A fimbriae [Der 
Vartanian, 1997] and type 4 fimbriae [Jennings, 1989]. All were chosen because their 
expression systems and biochemical properties were relatively well characterized in 
comparison to other fimbrial types of E. coli. 

In summary, foreign antigenic determinants (epitopes) from various 

15 organisms were tested for insertion or replacement into the major structural subunit 
protein of the fimbriae, such as: epitopes from the hepatitis B virus surface antigen, 
foot-and-mouth disease virus (FMDV), Poliovirus type 1, cholera toxin B, 
Mycobacterium leprae, Plasmodium falciparum, Transmissible gastroenteritis virus 
(TGEV), human influenza virus, Neisseria gonorrhoeae fimbrin subunit as well as the 

20 hormones gonadotropin releasing hormone and somatostatin. Positions for insertion or 
replacement were typically selected based on hydrophilicity profiles of the subunit 
sequence, sequence variability among different serovars and/or linker scanning 
mutagenesis. In each study, the foreign sequence (epitope) was detected using 
antibodies directed against these sequences. In many cases fimbriae were expressed, 

25 but this depended on the size and sequence of the foreign epitope and the position used 
within the subunit protein. These studies have shown that it is possible to construct 
bacterial strains expressing recombinant fimbriae carrying immunologically active 
foreign sequences without seriously affecting the properties of the fimbriae. There were 
some problems with these fimbrial expression systems, however. In each study, 
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expression of the mutant fimbrin genes was directed from a recombinant vector 
transformed into E. coli and the researchers had to reconstitute the expression of the 
fimbriae. Therefore, conditions for expression might not represent the physiological 
conditions in wild-type cells. 

5 No similar studies using Salmonella fimbriae as carriers of heterologous 

antigens have been published to date. This is due in part to the fact that Salmonella 
fimbriae have not been as well characterized as their E. coli counterparts. Flagella, on 
the other hand, have been quite well characterized in Salmonella, Accordingly, studies 
have been done in Salmonella using flagella as carriers of heterologous antigens. The 

1 0 results of these studies are outlined below. 

Expression of Heterologous Epitopes in Salmonella Flagella 

Research into the area ofexpression of epitopes on flagella has been 
previously published [Newton, 1989; Newton, 1991; Newton, 1995; Wu, 1989; Verma, 
1995; Verma, 1995; Cattozzo, 1997]. Flagellin studies have involved the insertion of 

15 various foreign DNA sequences into a single, hypervariable region of the major phase- 1 
flagellin subunit gene of an attenuated strain of Salmonella. Some of the epitopes that 
have been tested in this system are from Cholera toxin B, Hepatitis B surface antigens, 
Streptococcus pyogenes M protein, HIV-1, Influenza A hemagglutinin, Plasmodium 
spp., Rotavirus, Corynebacterium diptheriae, Listeria monocytogenes listeriolysin and 

20 Moth cytochrome c. Like the fimbrial systems described above, production of 
assembled flagella on the surface of the Salmonella cells was dependent on the 
sequence and length of the foreign epitope. Many of the studies tested the 
immunogenicity of the heterologous epitopes inserted in Salmonella flagellin by 
injecting the whole strain into mice, rabbits or guinea pigs; almost all immunized 

25 animals developed an Ab response to the epitope, proving that these sequences were 
still antigenic when expressed in flagella. Other studies showed that cell-mediated 
immunity could be obtained also [Verma, 1995; Verma, 1995] although this may 
depend on the particular epitope tested [Vanegas, 1997]. Finally, purified recombinant 
flagella can elicit humoral and cell-mediated immune responses when administered 

30 alone [Levi, 1996]. 



25 

The data obtained from these flagella studies indicate that an epitope can 
retain its immunogenicity even when expressed in the context of an unrelated protein. 
They also prove the effectiveness of Salmonella as a live attenuated vaccine. However, 
there are some problems, the major one being the relatively low expression levels of 
5 flagella on bacterial cells (<10 copies per cell) compared to high-expression levels of 
fimbriae (100-1000 per cell) or other cell-surface organelles. 

AgfA as a carrier of the PT3 epitope 

Ten different 16 amino acid segments within AgfA were chosen for 
replacement with the PT3 epitope and are highlighted in Figure 5B and 5C. These 

10 replacement regions were chosen using several criteria: primary sequence alignment, 
regions Al and A2 (red); protease-susceptibility data [Collinson, 1999], region A3 
(black; not represented in Figure 5C); secondary structure predictions matching the 
p-sheet half of PT3 with p-sheet regions in AgfA, regions A4-A7 (blue); and 
secondary structure predictions matching the a-helical half of PT3 with p-sheet regions 

15 of AgfA, regions A8-A10 (yellow). In total, all 131 residues within AgfA, except 6 in 
the N-terminal region and 8 at the extreme C-terminus, were replaced with the PT3 
sequence in at least one of the chimeric fimbrin constructs. 

Generation of S. enteritidis strains containing the asfA::PT3 genes 

All recombinant agfA::PT3 genes were generated by PCR, sequenced 
20 and introduced into the chromosome of S. enteritidis, replacing the W* agfA gene, using 
the procedure described above. All bacterial strains used are outlined below in Table 2. 

Table 2. S. enteritidis strains used in this study. 



Strain Description and relevant genotypes 

3b wild-type 

2-2a agfAr.TnphoA fusion in chromosome 

A+ similar to wild-type 

A 1 agfA : :PT3# 1 in chromosome 
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A2 agfA::PT3#2 in chromosome 

A3 agfA::PT3#3 in chromosome 

A4 agfA : :PT3#A in chromosome 

A5 agfA::PT3#5 in chromosome 

A6 agfA::PT3#6 in chromosome 

A7 agfA::PT3#7 in chromosome 

A8 agfA::PT3#% in chromosome 

A9 agfA::PT3#9 in chromosome 

A10 agfA::PT3#\0 in chromosome 



Western blot analysis of £ enter itidis strains containing agfA::PT3 in the chromosome. 

The recombinant S. enteritidis strains A1-A10 were grown on T plates at 
37°C for 24h and were analyzed for production of recombinant AgfA proteins 
5 containing the PT3 epitope. Replicate Western blots are shown in Figure 2A and 2B, 
reacted with either anti-TAF or anti-PT3 immune sera. All 10 chimeric strains were 
expressing their corresponding recombinant AgfA::PT3 proteins, although strains Al 
and A10 (Figure 6, lanes 3 and 14) expressed much lower levels. All chimeric fimbrin 
proteins differed from AgfA in that they could be detected to some degree on Western 
10 blots without formic acid pre-treatment. Proteins from strains A5 ? A6, A8, and A9 
(Figure 6 A lanes 7, 10, 12, and 13, respectively) appear smaller than purified AgfA 
(17kDa; Figure 6A lanes 1 and 9). The nature of these size or moblility differences was 
not investigated. These data confirmed the expression of the 10 chimeric AgfA::PT3 
proteins from the chromosome of S. enteritidis. 

15 Immunogold electron microscopy of S. enteritidis strains containing agfA::PT3. 

To determine whether the recombinant AgfA fimbrins expressed by S. 
enteritidis strains A1-A10 were being assembled into fimbrial fibers, immunogold 
labeling was performed with immune sera against TAF or PT3. Recombinant fimbrial 
fibers were observed on the surface of cells from strains A2-A9 in 8 out of 10 strains 

20 (representative EMs are shown in Figure 3). These fibers were distinct with definite 
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cell surface contact points and looked very similar to W + TAF from S. enter itidis 3 b 
(Figure 3), although the number of chimeric fimbrial fibers was slightly reduced in all 
cases. Strains A3, A4, A5 and A8 were judged to be the most similar to W + S. 
enteritidis 3b in terms of number of fibers per cell, the frequency of labelled cells, and 
5 the distinctness of the individual fibers. In contrast to the other S. enteritidis strains, Al 
and A10 did not display consistent labeled fimbrial fibers at the cell surface, although 
some individual cells did show labeling. Thus, these results indicated that 8 of the 10 
recombinant AgfA::PT3 proteins were efficiently assembled into fimbrial fibers 
displayed at the cell surface of S. enteritidis, 

10 Since growth on solid media is a relatively static environment, the 

production of recombinant fimbrial fibers in broth culture was examined to determine 
whether they were stably attached to the S. enteritidis cell surface. Immunogold 
electron microscopy was performed as above on cells grown at 37°C for 24h in T broth; 
representative EMs are shown in Figure 3. These results were very similar to those 

1 5 observed for cells grown on T plates, although more non-cell-associated fimbrial fibers 
were observed on the T broth grids. Again, 5. enteritidis strains A2-A9 had significant 
amounts of cell-associated recombinant fimbrial fibers with strains A3, A4, A5, and A8 
being the most similar to W + S. enteritidis 3b. However, like the T plate samples, A2- 
A9 were judged to have slightly fewer and less distinct fimbrial fibers than W + . No 

20 consistent cell-associated chimeric fimbrial fibers were observed for S. enteritidis 
strains Al and A10. These results confirmed that 8 of the 10 AgfA::PT3 fimbrin 
proteins were efficiently assembled into recombinant fimbrial fibers present at the cell 
surface of S. enteritidis and proved that their attachment to the cell surface was 
relatively stable. These results are further summarized in Table 3. 

25 

Table 3. Properties of S. enteritidis strains containing recombinant agfA. 





Protein Size 


(kDa) 


T Plates 


T Broth 


Strain 


Predicted 


Experimental 


+FA 


Fibers 


Fibers 


3b 


13.4 


17 


+++ 


+++ 


+++ 


2-2a 




>60 


++ 






Al 


13.6 


17.5 


+ 


+/- 




A2 


13.7 


17 


+++ 


++ 


++ 
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n 7 


17 

1 / 


-h-h -H- 


++ 


A4 


n 6 


17 


-h++ +-h 


++ 




1 J.O 


14 S 


+++ ++ 


+-h 




1 J. J 


1 5 5 


++ ++ 




A7 


13.7 


17 


++ + 




A8 


13.5 


15.5 


++ ++ 


++ 


A9 


13.6 


14.5 


++ -Hh 


-h-h 


A10 


13.4 


17 


+/- 


+/- 


A+ 


13.4 


17 


-h-h-h -H-+ 


-h++ 



Resistance of the recombinant TAF fibers to proteinase K digestion. 

Although the recombinant fimbrial fibers looked similar to W + TAF, it 
further analysis was conducted to determine if they retained some of their stable 
5 properties. Western blot analysis of the S. enteritidis strains A1-A10 after proteinase K 
digestion for 1 or 2 hr is represented in Figure 5A and 5B. Most of the chimeric 
fimbrial fibers were resistant to proteinase K, with all strains except A6, A9, and A10 
(Figure 5, lanes 10, 13, and 14) displaying immunoreactive protein bands after 2hr of 
digestion. The immunoreactive band observed for strain Al (Figure 5, lane 3) 

10 suggested that this strain might also possess a small number of polymerized chimeric 
fimbrial fibers even though they were not consistently present at the cell surface when 
viewed by EM. As expected, AgfA from S. enteritidis 3b or the A + control strain 
remained intact (Figure 5, lanes 8 and 15), the dimer demonstrating the two domain 
structure of AgfA [Collinson, 1999]. Multiple immunoreactive bands were observed 

15 for strains A4, A5, A7 (Figure 5, lanes 6, 7, and 1 1). These results indicated that most 
of the chimeric fimbrial fibers produced in S. enteritidis possessed a stable, ordered 
superstructure that was protease resistant, very similar to W + or mature thin aggregative 
fimbriae. 
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10 Example 1 

Structural Predictions of AgfA, the Insoluble Fimbrial Subunit of 
Salmonella Thin Aggregative Fimbriae 
The unusually stable and multifunctional, thin aggregative fimbriae 
common to all Salmonella spp. are principally polymers of the fimbrin subunit, AgfA. 

15 AgfA of Salmonella enteritidis consisted of two domains: a protease-sensitive, 22 
amino acid N-terminal region and a protease-resistant, 109 residue C-terminal core. 
The unusual amino acid sequence of the AgfA core region comprised 2-, 5- and 10-fold 
internal sequence homology patterns reflected in 5 conserved, 18-residue tandem 
repeats. These repeats had the consensus sequence, Sx 5 QxGx 2 NxAx 3 Q and were linked 

20 together by 4 or 5 residues, (x)xAx 2 . The predicted secondary structure for this unusual 
arrangement of tandem repeats in AgfA indicated mainly extended conformation with 
the p-strands linked by 4 to 6 residues. Candidate proteins containing motifs of 
alternating (3-strands and short loops were selected from folds described in SCOP as a 
source of coordinates for AgfA model construction. Three all-p class motifs selected 

25 from the Serratia marcescens metalloprotease, myelin P2 protein or vitelline membrane 
outer protein I were used for initial AgfA homology build-up procedures ultimately 
resulting in three structural models, (3 barrel, p prism and parallel p helix. The p barrel 
model suggested a compact, albeit irregular structure, with the P-strands arranged in 
two antiparallel p-sheet faces. The p prism model did not reflect the five- or ten-fold 

30 symmetry of the AgfA primary sequence. The favored, parallel p helix model was a 
compact coil of ten helically arranged p-strands forming two parallel p -sheet faces. 
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This arrangement predicted a regular, potentially stable, C-terminal core region 
consistent with the observed tandem repeat sequences, protease-resistance and strong 
tendency of this fimbrin to oligomerize and aggregate. Positional conservation of 
amino acid residues in AgfA and the E. coli AgfA homologue, CsgA, provided strong 

5 evolutionary support for this model. The parallel P helix model of AgfA offers an 
interesting solution to a multifunctional fimbrin molecular surface having solvent 
exposed areas, regions for major and minor subunit interactions as well as fiber-fiber 
interactions common to many bacterial fimbriae. 

Fimbriae and pili are terms, often used interchangeably, for fine, hairlike 

10 structures that protrude from the surface of many bacteria (Firth et ah, 1996; Low et ah, 
1996; Mol & Oudega, 1996; Tennent & Mattick, 1994). These ubiquitous fibres have 
received considerable attention mainly due to their pivotal role in facilitating bacterial 
adherence in diverse circumstances including host tissue colonization (Gaastra & de 
Graaf, 1982; Low et ah, 1996; Tennent & Mattick, 1994), bacterial cell-cell interactions 

15 (Marceau et ah, 1995; Whittaker et ah, 1996), conjugal transfer of DNA including 
exchange of antibiotic resistance genes (Firth et ah, 1996; Silverman, 1997) or transfer 
of genetic information into susceptible hosts (Fullner et ah, 1996; Lai & Kado, 1998). 
Many diverse fimbrial and pili systems have been studied in detail to understand the 
molecular basis for fiber function, assembly and regulation. This information has 

20 furthered the understanding of fundamental physiological phenomena including protein 
processing, transport and organelle assembly (de Graaf & Bakker, 1992; Firth et ah, 
1996; Hung et ah, 1996, Kuehn, 1994; Leathart & Gaily, 1998; Low et ah, 1996; Mol 
& Oudega, 1996; Smyth et ah, 1996; Strom & Lory, 1993). In addition, these studies 
have furnished biotechnological advances in disease detection and prevention in the 

25 form of fimbrial-based diagnostics and therapeutics, fimbrial vaccine formulations and 
passive immunization methodologies (Abraham et ah, 1985; Der Vartanian et ah, 1994; 
Doran et ah, 1993; Levine et ah, 1994; Ogawa et ah, 1997; Pallesen & Klemm, 1994; 
Thorns et ah, 1994). 

Fimbriae are complex polymers, 2 to 9 nm wide and several hundred or 

30 thousand nm long. Structurally, they are comprised mainly of a single fimbrin (or pilin) 
protein subunit type polymerized to form the bulk of the fiber (Low et ah, 1996; 
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Paranchych, 1990; Silverman, 1997; Tennent & Mattick, 1994). Additional 
constituents have been identified in many fimbriae (Low et a/., 1996). These include 
minor fimbrin-like proteins required for the fidelity of fiber biogenesis (de Graaf & 
Bakker, 1992; Klemm & Krogfelt, 1994; Kuehn et al., 1994) and/or specialized adhesin 

5 subunits required for specific receptor recognition (Low et al, 1996). In other cases, as 
with K88 fimbriae, receptor binding specificity is an apparent function of the major 
fimbrin subunit itself (de Graaf & Bakker, 1992). 

Low resolution X-ray and optical diffraction studies of laterally 
aggregated or paracrystalline arrangements of thin (2 to 5 nm) and thick (7 to 9 nm) 

10 fimbriae indicate that, in both fiber types, the major fimbrin subunits are helically 
arranged within the fiber (Brinton Jr, 1965; Gong & Makowski, 1992; Heck et al, 
1996; Paranchych, 1990; Silverman, 1997; Simons et ah, 1994). High resolution, 
freeze fracture electron microscopy studies of native and mutant E, coli Pap pili 
ultrastructure revealed that this 7 nm pili polymer terminates with a thin, 2 nm wide, 

15 fibril of open helically arranged minor subunits that display the terminal adhesin 
subunit required for specific binding of the fiber to its respective target receptor 
(Klemm & Krogfelt, 1994; Kuehn et a/., 1992). This complex architecture is shared by 
other thick fimbriae of E. coli including type I, S and Haemophilus pili (Hung et al., 
1996; Jones et al, 1997; St. Geme III et al, 1996). In addition, the structural plasticity 

20 of fimbriae is evident after certain chemical treatments that "unravel" thick fimbriae 
into thin fibrils or "wind" thin fibres into thicker structures (Mol & Oudega, 1996). 
However, the ultrastructure of thin fimbriae is largely unknown. 

The Neisseria gonorrhoea Type IV pilin subunit is the sole fimbrin to 
date to be crystalized and for which a 3-D structure has been detemined (Parge et al, 

25 1995). This novel a-p roll pilin structure was modeled into a helically polymerized 7 
nm fiber based upon the steric considerations and physicochemical surface 
characteristics of the pilin subunits (Parge et ah, 1995). Otherwise, no tertiary 
structural information is available for other fimbrins. The pronounced tendency of 
unassembled or depolymerized fimbrin monomers to aggregate or repolymerize 

30 complicates crystal formation and consequently precludes 3-D structural elucidation. 
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The present Example relates to the structural elucidation of thin 
aggregative fimbriae of Salmonella. Thin aggregative fimbriae are one, of at least 8 
fimbrial types produced by various strains of these important animal pathogens 
(Baumler & Heffron, 1995; Doran et al 9 1993; Low et al 9 1996). The exact role(s) of 

5 thin aggregative fimbriae (3-4 nm) in Salmonella pathogenesis is as yet uncertain 
(Sukupolvi et ah, 1997a), even though these fibres have been shown to bind various 
host proteins (Collinson et ah, 1993; Sjobring et ah, 1994) and promote interaction of 
Salmonella with mouse small intestinal epithelial cells (Sukupolvi et al 9 1997b). Given 
the extreme stability and unusually aggregative nature of these fibres, their importance 

10 to Salmonella pathogenesis may also be related to their role in promoting Salmonella 
autoaggregation (Collinson et al 9 1993; Collinson et al 9 1991). This feature has broad 
implications for host infection (Collinson et aL 9 1991) and Salmonella biofilm 
formation on inert surfaces (Austin et aL 9 1998). 

Virtually all Salmonella spp. tested to date possess the agfA fimbrin 

15 gene (Baumler et al. 9 1997; Collinson et al 9 1996b; Doran et al. 9 1993). Escherichia 
coli also possess an agfA fimbrin homologue, csgA 9 that assembles into fibres named 
curli (Olsen et al 9 1993). Salmonella thin aggregative fimbriae and E. coli curli are 
biochemically (Collinson et al 9 1992; Doran et al 9 1993; Olsen et al 9 1993), 
genetically (Collinson et al. 9 1996a; Romling et aL 9 1998), and functionally (Austin et 

20 al 9 1998; Collinson et al 9 1993; Collinson et al 9 1992; Hammar et al 9 1995; Olsen et 
al 9 1993; Vidal et al 9 1998) analogous. This is consistent with the fact that both 
fimbrin genes are highly conserved homologues from an ancient operon in the 
progenitor of Salmonella and E. coli (Baumler et aL 9 1997). 

Thin aggregative fimbriae were purified by an unconventional procedure 

25 from Salmonella enteritidis (Collinson et al 9 1991). In addition, brief exposure of the 
fimbriae to 90% formic acid (Collinson et al 9 1991) is the only pretreatment known to 
depolymerize these fibres to release the AgfA fimbrin monomers. Since this initial 
characterization, very little biophysical data have emerged for thin aggregative fimbriae 
due to their extreme aggregative nature and the existence of multiple isoforms 

30 (Collinson et aL 9 1991). However, such information is needed to understand the 
molecular basis for the aggregative nature and unusual stability of these fibres as well as 
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to provide the molecular framework for the design of AgfA-based heterologous 
Salmonella vaccines (White et aL, 1999). 

As a novel approach to explore AgfA tertiary structure, homology 
buildup and molecular modeling techniques were applied to the prediction of AgfA 
5 tertiary structure taking into account the intriguing primary amino acid sequence repeat 
motif in AgfA and the discovery of the protease resistant AgfA C-terminal core region. 
As the first modeling study of any bacterial fimbrin or pilin tertiary structure, the initial 
goal was to obtain a working model of AgfA molecular structure for future 
experimental design and testing. 

10 

Materials and Methods 

Purification and depolvmerization of thin aggregative fimbriae, SEF17. 
Thin aggregative fimbriae were isolated and purified from Salmonella enteritidis 
27655-3b grown on solid T medium as previously described (Collinson et al, 1991). 

15 Purified fimbriae samples were routinely treated with 90% formic acid to depolymerize 
the AgfA fimbrin subunits prior to electrophoretic or immunoblot analysis (Collinson et 
al, 1993; Collinson et al, 1991). 

The effect of formic acid concentration on disaggregation and 
depolymerization of thin aggregative fimbriae was determined on fimbrial suspensions 

20 of 0.5 mg/ml. The large clumps of fimbriae were broken using a 2 ml Micro Tissue 
Grinder (VWR Canlab). Homogenized suspensions of fimbriae were aliquoted into 
microfuge tubes and the fimbriae recovered by centrifiigation (15,600 x g, 10 min) 
before being resuspended in formic acid solutions and recording the absorbance at 600 
nm (A 600 ). A subsample of each fimbrial suspension containing approximately 20 |ig of 

25 protein was lyophilized and subjected to electrophoresis and immunoblot analysis. 

SDS-PAGE and immunoblot analysis. SDS-PAGE was performed 
according to the method of Laemmli (Laemmli, 1970). Immunoblot analysis of AgfA 
was performed as previously described (Collinson et aL, 1993; Collinson et aL, 1991) 
using rabbit polyclonal immune serum generated to purified thin aggregative fimbriae. 

30 N-terminal amino acid analyses. N-terminal amino acid sequence 

analyses were performed as previously described (Collinson et al., 1991). 
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Protease treatment of thin aggregative fimbriae. Whole S. enter itidis 3 b 
cells possessing thin aggregative fimbriae or purified thin aggregative fimbriae were 
resuspended in Laemmli sample buffer (Laemmli, 1970), boiled for 10 min, cooled and 
then brought to a final concentration of 0.5 mg/ml proteinase K before incubation for 1 
5 hr at 60°C. Samples were boiled, fimbriae recovered by centrifugation (15,600 x g, 10 
min), depolymerized with 90% formic acid (Collinson et aL, 1991) and then analysed 
by SDS-PAGE or immunoblotting for products of digestion. 

Secondary structural analysis of AgfA . Secondary structure analysis of 
AgfA was determined using several programs including: Alexis version 1.2 SEQSEE 
10 (Wishart et aL, 1994), PPSP (Parker & Hodges, 1991a; Parker & Hodges, 1991b), 
Sequences Annotated by Structure (www.biochem.ucl.ac.uk/cgi-bin/sas), Gibrat 
Secondary Struture Prediction and Hierarchical Neural Network (pbil.ibcp.fr/NPSA, 
NNPredict (www.cmpharm.ucsf.edu/cgi-bin/nnpredict.pl), PhDsec (www.embl- 
heidelberg.de/predictprotein) and Predator (www.embl-heidelberg.de/predator). 
15 Homology alignment and selection of proteins for homology buildup. 

The AgfA amino acid sequence was compared to sequences of proteins with known 
tertiary structures defined by analysis of X-ray crystallography using SEQSEE (Wishart 
et aL, 1994), HomologyPlot (Parker & Hodges, 1991b) and PropSearch (Hobohm & 
Sanders, 1995). Several candidate proteins for structural alignment were chosen by 
20 scanning protein folds described in the Protein Data Bank (PDB) (Bernstein et aL, 
1977) and the program SCOP (Murzin et aL, 1995) to identify several proteins 
containing p-strand structures connected by short loops. Two proteins, the Serratia 
marcescens protease (SMP) (Baumann et aL, 1995) and the C-terminal region of the 
bovine myelin P2 protein (PMP) (Cowan et aL, 1993; Jones et aL, 1988), contained a 
25 repeating pattern of hydrophobic residues analagous to that proposed for AgfA. SMP 
and PMP were chosen as templates for the (5 barrel and parallel (3 helix models, 
respectively (Fig 16 a, b). The (5 prism fold of the vitelline membrane outer layer 
protein I (VMO-I) (Shimizu & Morikawa, 1996; Shimizu et aL, 1994) was chosen as a 
third structural template on the basis of the VMO-I primary sequence alignment with 
30 that of AgfA (Figure 1 6 c). 
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AgfA Parallel p-helix model homology buildup procedure. The parallel 
P-helix structure of AgfA was built using Insight II and Discover programs (BioSym 
Technologies Inc.). Initially, the a-carbon distance restraints of amino acid residues 327 
to 343 of the known x-ray structure for SMP (Baumann, 1994; Braunagel & Benedik, 

5 1990) were used to build the first p-turn-p structure corresponding to amino acid 
residues 23 to 39 of AgfA (Figure 16a). Amino acid residues in SMP were replaced 
with those corresponding to the AgfA sequence and the AgfA structure was built 
manually to residue 52. The resulting 30-residue AgfA template, which no longer 
resembled the SMP structure due to differing loop sizes and longer p-strands, was used 

10 repeatedly with dynamics simulations to build successive levels of the AgfA structure. 
The completed AgfA structure was then minimized using various distance restraints, 
manual changes in sidechain and backbone torsion angles with regular checks on the 
chiral status of the backbone configurations. Distance restraints were gradually 
removed until a low energy structure of 80 kCal was obtained using only Ca restraints. 

15 Unfavorable angles and distances were removed using molecular dynamics (MD) 
simulations of 1 ps with a full set of distance restraints. The optimized structure was 
then subjected to 50 ps of MD simulations with no constraints to further reduce the 
energy of the structure. The distance restraints used were briefly tested by simulated 
annealing. Further improvements to the model were accomplished by generating a 

20 distance and dihedral restraint file using the average Ca to Ca distances observed in the 
respective predicted structure and the x-ray structure used to the model. The 
preliminary dihedral restraint for the angle f and y was set for an extended structure. 
The structure was minimized with and without their respective restraint files. The 
dynamics procedure was used to father improve the f and y angles. 

25 AgfA p-barrel model homology buildup. The p-barrel structure of 

AgfA was built up by homlogy modeling using coordinates of the ten p-strand p-barrel 
of PMP as the inital template. Residues 40 to 86, comprising the first 5 p-strands of 
PMP, were the most homologous with residues 25 to 72 of AgfA (Figure 16b). This 
PMP sequence was converted to the correct AgfA sequence to build the first AgfA 

30 region. This first region was then duplicated and the N-terminal residues L25 to 127 of 
AgfA were overlapped with C-terminal residues 170 to L72 to form the template for the 



38 

second set of 5 p-strands. The amino acids in this second region were then changed to 
correspond to the later half of the actual AgfA sequence. Visual inspection removed 
several obvious side chain contacts before the model was subjected to Steepest descent 
and conjugate gradient minimization. 

5 AgfA B prism model homology buildup. The P prism AgfA structure 

was built based upon alignment of AgfA sequence with VMO-I taking into account 
three core loops and conserved nonpolar-polar-nonpolar motifs at the N-terminal region 
of the AgfA core and conserved VMO-I D and E residues that are N and Q in AgfA 
(Figure 16c). The first of three interal repeats was mutated form VMOI sequence to 

10 AgfA using options in InSightll (Biosym). Occasionally, side chains were moved 
manually when unfavorable interactions were encountered. The model was subjected to 
conjujate gradient minimization with the C a carbons fixed. 

Statistical evaluation of models. The VADAR (Wishart et al 9 1995) and 
PROCHECK (Morris et al 9 1992) statistical analysis programs were used to evaluate 

15 the models at several stages during the buildup procedure as an indication of 
improvement in successive models. Statistics and energy minima of the three prototype 
AgfA fimbrin model structures are presented in Table 4. 

Table 4. Summary of Procheck and VADAR statistics for the AgfA model structures 
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AgfA Model Structure 
parallel p helix P barrel p prism 



Statistic Parameter 



Template protein 3 
25 Free energy (kCal) 
Total energy (kCal) 
f/y core b 

Total accessible surface area (A 2 ) 
Total volume packing (A 3 ) 



SMP 
-64 
-240 
68% 
5524 
12875 



PMP 



-49 



63% 



5848 



12783 



VMO-I 



-40 



56% 



7625 



12500 



% p-strand predicted 



39 

93 



71 40 



a SMP, Serratia marcescens protease (Baumann et aL, 1995); PMP, bovine myelin P2 protein (Cowan et 
al, 1993; Jones et ah, 1988); VMO-I, vitelline membrane outer layer protein I (Shimizu & 
5 Morikawa, 1996; Shimizu et a/., 1994). 

b f/y core calculated using Procheck™ (Morris et al 9 1992) and VADAR (Wishart et aL, 1995). 

RESULTS 

Two-domain character of AgfA fimbrin . Treatment of thin 

10 aggregative fimbriae with at least 70% formic acid was required to ensure at least 
partial depolymerization to release the 17,000 Da AgfA fimbrin (Figure 7). 
Concomitant with the depolymerization of thin aggregative fimbriae was the release of 
a 15,500 Da protein (Figure 7). N-terminal amino acid sequence analysis of the 15,500 
Da species indicated that this minor protein was a truncated form of AgfA missing 22 

15 N-terminal amino acids. This fragmentation of AgfA was probably not due to its 
exposure to concentrated formic acid since prolonged treatment of AgfA with 90% 
formic acid for up to 3 h at 50°C does not increase the amount of the 15,500 Da 
fragment (Collinson et ah, 1991). Moreover, AgfA lacks acid cleavable Asp-Pro bonds 
in the predicted AgfA sequence (Collinson et ah, 1996a). However, the first 17 N- 

20 terminal amino acids of AgfA could be cleaved from the fimbrin by proteinase K 
treatment of intact native or purified, thin aggregative fimbriae (Figure 8, lanes 2 and 4) 
as confirmed by N-terminal amino acid sequence analysis of the truncated proteins. 
However, once depolymerized by pretreatment with formic acid, AgfA was completely 
susceptible to digestion with proteinase K. These results suggested that polymerized 

25 AgfA fimbrins possessed two domains, a short, protease-susceptible N-terminal domain 
of 17 to 22 amino acids and a protease-resistant C-terminal core region comprised of 
109 to 1 14 amino acid residues. 

Two-, five- and ten-fold internal amino acid sequence homology of 
AgfA . The AgfA primary amino acid sequence possessed features consistent with a 

30 two-domain protein. The gly cine-rich, N-terminal 22 amino acid sequence was 
distinguished from the C-terminal core region comprising amino acids 23 to 131 mainly 
on the basis of a striking two-, five- and ten-fold internal sequence homology within the 
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C-terminus (Figure 9). A two-fold homology was evident between the segments C2a 
and C2b that shared 41% amino acid identity or 56% homology if conservative 
substitutions were considered (Figure 9b). 

The five-fold sequence homology within the 109 residue C-terminal core 
5 region was extremely regular and consisted of five, tandemly arranged segments (Figure 
9c). The first 18 amino acids of each of these five segments conformed to the 
consensus sequence Sx 5 QxGx 2 NxAx 3 Q separated by 4 or 5 additional residues except 
for the terminal repeat which ended in Y (Figure 9c). Homology comparisons between 
segments C5a:C5d and C5c:C5e were the highest with 50% identity compared to 

10 segments C5b:C5d or C5d:C5e with 44% residue identity (Figure 9a,c). The other 
segment pairs were 39% identical. 

The ten-fold sequence homology of the AgfA C-terminus was evident as 
paired hydrophobic regions centered at positions 3 to 5 and 14 to 16 within each 
consensus motif (Figure 9d,e). Each hydrophobic region possessed three amino acids 

15 arranged to conserve a nonpolar-polar-nonpolar (ifi) triplet motif (Figure 9e). Together, 
these multiple primary sequence patterns were suggestive of highly regular, repeated 
secondary structure motifs. 

Secondary structure analysis of AgfA . Several secondary structure 
prediction programs were used to analyse AgfA secondary structure as described in the 

20 Materials and Methods. All of the programs predicted several regions of extended 
structure separated by short regions of 4 to 10 residues possessing coil conformation 
(Figure 10). Frequently, the programs predicted that AgfA residues 34 to 41 adopted a 
helical structure where as residues 80 to 87 and 103 to 107 were predicted to be helical 
with less frequency (Figure 10). No potential membrane spanning sequences were 

25 predicted. The ten repeating hydrophobic core regions within AgfA suggested that this 
protein posessed regular, repeating secondary structural units of 3 to 7 amino acids 
connected by loops of 4 to 7 residues. 

The Protein Data Bank and SCOP were extensively searched to identify 
protein structures with multiple extended regions separated by coil regions of 4 to 10 

30 residues suitable for model construction as described in the Materials and Methods. 
Several structures with low homology to AgfA were identified but none of these 
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candidate protein structures contained helical motifs. For this reason, only the extended 
and coil secondary structures predicted for AgfA were used to predict the tertiary 
structure of AgfA. 

Parallel p-helix model of AgfA . The regular pattern of extended 

5 p-strand conformations proposed for AgfA C-terminal region secondary structure was 
used to select coordinates of candidate template proteins of known tertiary structure 
suitable for model construction (see Material and Methods). A p-turn-P motif within 
the p roll structure of the Serratia marcescens metalloprotease (SMP) 3-D crystal 
structure (Baumann et aL, 1995) was judged as the best template on which to build the 

10 inital turn of the AgfA model structure. Successive layers of AgfA tertiary structure 
were then built using standard homology modeling techniques (see Materials and 
Methods). The parallel p helix AgfA tertiary stucture model that emerged from this 
process was a highly regular, compact, right handed, five-coil helix comprised of four 
coils of 22 or 23 amino acids ending with 18 amino acids of the fifth coil followed by a 

15 terminal Y (Figure 11a). This arrangement of vertically stacked, slightly flattened coils 
created two prominent five-strand parallel p-sheet faces (Figure 5a) oriented into a 
slightly wedge-shaped, oval molecule (Figure lib). Each coil consisted of a 
"p-sandwich" in which the nonpolar residues of the two nonpolar-polar-nonpolar (ifi) 
P-strand triplets at positions 3 to 5 and 14 to 16 of each 18-residue repeat were 

20 internalized forming a tight 4-residue turn (xGx 2 ) centered about the conserved G at 
position 9 (Figs. 9e & 11c). The successive p-sandwiches were joined by a less 
constrained turn of 4 or 5 residues (Figs. 9e & 1 lc). 

Consequently, the five, vertically stacked p-sandwiches were aligned 
such that the conserved residues of each consensus sequence were in the same relative 

25 position resulting in vertical stacks of like residues in the internal core of the AgfA 
model structure (Figs. 11c, 12a & 12b). Thus, the AgfA parallel p helix model 
possessed a 19-residue hydrophobic core comprised of nonpolar residues at positions 3, 
5, 14 or 16 of the consensus sequence (Figs. 11 c & 12 a). Flanking one side of this 
non-polar core region were two stacks of five internalized Q and N residues located at 

30 positions 7 and 12, respectively, of each consensus sequence (Figs. 11c & 12a). These 
polar amides, Q29,52,74,97,119 and N34,57,79, 102,1 24, respectively, flanked either 
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side of each successive tight turn such that the C a R amide sidegroups were hydrogen 
bonded to the peptide backbone (Figs. 1 1c & 12a). The other side of the hydrophobic 
core was similarly flanked by two stacks of five polar residues internalized at positions 
1 (S23,46,68,91,113) and 18 (Q40,63,85,108,130) of each successive consensus 

5 sequence (Figs. 1 lc & 12a). 

A second consequence of the helical arrangement of successive p 
sandwiches was the clustering of chemically similar residues on the surface of the 
proposed AgfA model. Hydrophobic patches were present on this AgfA model on the 
frontside (W86, L37, L39, L105) (Figure 12c, green and yellow) and end (Y28, Y30, 

10 Y98, F77, F122, VI 20) (Figure 12d, green) of the molecule. Otherwise, the proposed 
AgfA surface was mainly comprised of polar or charged residues (Figs. 12b,c,d, red). 
The acidic residues E47,71 and D42,60,66,84,92,112 (Figure 12 c,d, red) were predicted 
to be exposed on the end of the molecule opposite to the hydrophobic cluster of Y and F 
residues (Figure 12d, green) such that they out numbered the basic amino acids (Figure 

15 1, blue) 8 to 3. The back face of the AgfA model comprised mainly exposed polar 
amino acids (Figure 12d, red). Several residues with exposed hydroxyl groups were 
present including S26,114; 124,49,51,69,73,94 and Y28,30,98 (Figure 12d, red). This 
compact, highly regular parallel p-helix model of AgfA was consistent with the 
repeating primary amino acid sequence and predicted secondary structure of repeated 

20 p-strand conformations. 

p-barrel and p-prism models of AgfA . In an effort to determine 
whether the AgfA parallel p-helix structure was a reasonable model, the AgfA primary 
sequence was also modeled on two other proteins of known structure, also selected for 
possessing motifs arranged in a series of extended p-strands. 

25 The AgfA p-barrel model (Figure 13a) was built using the coordinates 

of the ten-stranded p-barrel structure of bovine myelin P2 protein (PMP) 3-D crystal 
structure (Cowan et a/., 1993; Jones et a/., 1988) as the template (Figure 13b) as 
described in the Material and Methods. In this AgfA prototype model, each of the five 
consensus repeat motifs were folded about the central QxGx 2 N sequence such that the 

30 predicted p-strands were bonded in antiparallel fashion (Figure 13a). Segments C5a, 
C5b and the C-terminal half of C5e formed one antiparallel p-sheet face of the barrel 
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where as the other face was formed by C5c, C5d and the N-terminal half of C5e (Figs. 
9a & 13a). Several key features of the AgfA (5-barrel model included: a hydrophobic 
core region comprised mainly of internalized nonpolar residues of the ten nonpolar- 
polar-nonpolar (ifi) triplets; surface exposure of the aromatic residues Y28,30,55 and 

5 F77, 122 at the same end of the molecule; basic residues R44 and K45,89 clustered at 
one end of the molecule on the protruding loop between segments C5a and C5b and a 
general lack of clustering of acidic and hydroxyl-containing residues. 

The AgfA p-prism model (Fig 14a) was based on coordinates from the 
X-ray crystal structure of vitelline membrane outer protein, VMO-I (Shimizu & 

10 Morikawa, 1996; Shimizu et aL 9 1994) (Figure 14b). This AgfA model predicted a 
fairly compact, elongated structure with several short p-strand regions of 3 to 6 
residues joined by loops of random coil (Figure 14a). The (3-prism model did not 
reflect the five-repeat symmetry of the primary sequence but rather an organization of 
three repeats based upon the first, third and fifth repeat. The nonpolar residues were 

15 internalized to generate a hydrophobic core but the conserved N and Q residues of the 
five consensus repeats were generally surface exposed. The acidic residues and many 
of the hydroxyl-containing S and T residues were clustered towards the same end of the 
molecule where as basic residues were featured at both ends of the molecule. The F and 
Y residues were surface exposed but no extensive surface hydrophobic patches were 

20 predicted. 

AgfA Model Quality. Each of the three model stuctures obtained for 
AgfA by homology modeling represented a reasonable, preliminary protein structure 
based on the VADAR statistical analyses performed (Table 4). Further refinements 
were not attempted at this stage since physical data on AgfA structure are not yet 

25 available to corroborate a given model. Each model predicted a series of (3-strands 
separated by turns with the predicted surface-exposed residues in general agreement 
with proposed buried and surface exposed residues in each of the tertiary structure 
models (Figs. 12, 13 & 14). However, the p-prism model deviated somewhat from the 
predicted surface exposed residues of AgfA in this respect. 

30 Evolutionary conservation of Salmonella AgfA primary amino acid 

consensus sequence . Previous alignment of the AgfA fimbrin subunit of Salmonella 
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thin aggregative fimbriae with that of E. coli curli fimbriae indicated that these two 
proteins, including the signal sequence, were 74% identical and 86% similar (Collinson 
et al, 1996a). A more detailed analysis of the alignments of these two related fimbrins 
was performed to determine the pattern and extent of conservative versus non- 
conservative amino acid substitutions (Figure 15, Table 5). The 22-residue N-terminal 
segment of AgfA was 82% (18/22) identical, but 91% (20/22) similar to CsgA if 
conserved residues were considered (Table 5). The 109 amino acids of the C-terminal 
domain were 71% (77/109) identical and 80% similar if ten additional conservative 
residue replacements were considered. 

Table 5. Amino acid conservation between Salmonella AgfA and E. coli CsgA. 



Consensus 

sequence 

position 3 



Protein 



Identity 



Percent 
(number of residues/total) 
Conservative Different 



15 Inside (•): 

(1,3,5,7,12,14,16,18) 

Outside (o): 

(2,4,6,13,15,17) 

Turns (t): 
20 (8-11,19-23) 



AgfA/CsgA 92.5 (37/40) 



5.0(2/40) 2.5(1/40) 



AgfA/CsgA 50.0 (15/30) 20.0 (6/30) 30.0 (9/30) 



AgfA/CsgA 64.1 (25/39) 5.1 (2/39) 30.8 (12/39) 



a The position of amino acids in each consensus sequence are labeled (•, o, t) according to parallel (3 

helix model as in Figure 9. 
b Each residue of the AgfA fimbrin was compared to the respective amino acid in CsgA and assessed 
25 whether the pairs were identical, similar or different and scored according to the following groups; 

hydroxyl (T, S); acidic (E, D); basic (R, K, H); amide (Q, N); aromatic (Y, F, W); aliphatic (A, M, I, 

L, Y) or glycine (G). 



The distribution of identical and conserved residues compared to that of 
30 non-conserved residues within the repeated segments of the two fimbrins was very 
distinctive. In general, residues predicted to be internalized in the parallel p-helix 
model of AgfA were more highly conserved between the two fimbrin proteins than 
residues predicted to be on the p-sheet surfaces or at the predicted turn positions 
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(Figure 15, Table 5). Residues at positions 1,3,5,7 and 12,14,16,18 of each repeat motif 
were 92.5% identical and 5.0% conserved (Figure 15, Table 5). However, residues 
occupying the alternate positions in the consensus sequence at predicted surface- 
exposed positions 2,4,6 and 13,14,15 of each repeat motif were only 50% (15/30) 
5 identical with an additional 20% (6/30) of the residues conserved for a 70% overall 
similarity (Figure 15, Table 5). Similarly, the two regions comprising expected turns at 
postions 8,9,10,11 and 19,20,21,22,23 had an overall similarity of 69.2% (Figure 15, 
Table 5). 

10 DISCUSSION 

Three hypothetical, all-p class tertiary structure models were assembled 
for AgfA, the fimbrin subunit protein of Salmonella thin aggregative fimbriae. These 
models represent the first predictions of a bacterial fimbrin or pilin tertiary protein 
structure using homology buildup and molecular modeling techniques. The primary 
15 amino acid sequence repeat motif comprising the protease-resistant AgfA C-terminal 
core facilitated this novel approach to explore the tertiary structure of an aggregative 
structural protein. 

One feature of the AgfA fimbrin of Salmonella thin aggregative fimbriae 
is the highly conserved, five-fold repeated consensus sequence, 

20 SxifixQxGx 2 NxAfixQ(x)x 3 comprising 109 residues of the protease resistant C-terminal 
core region of the fimbrin. This region is preceded by a distinctive, protease- 
susceptible glycine-rich N-terminus of 17 to 22 residues. Except for the AgfA fimbrin 
homologue of E. coli curli, CsgA (Hammar et ah, 1996), no other fimbrial proteins 
possess highly conserved, tandem primary amino acid sequence repeat motifs. 

25 Analysis of other fimbrins and pilins using common predictive 

algorithms for secondary structure indicate that like AgfA, these structural proteins have 
p-strand conformations distributed throughout the subunit protein (Girardeau et ah, 
1991; Klemm, 1984; Lintermans et ah, 1988). An analysis of 24 various bacterial 
fimbrins and pilins by the method of Gamier et ah (1978) similarly indicated 31% to 

30 69% predicted p-strand content often with multiple p-strand motifs present throughout 
the protein (Collinson et ah, unpublished data). The presence of amphipathic p-strands 
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within various Enterobacteriaceae fimbrial components also has been recognized by 
hydrophobic cluster analysis (HCA) (Girardeau & Bertin, 1995; Mechin et al, 1995). 
Using HCA, fimbrial protein components of thick (7 nm) fimbriae that possess a 
cysteine loop are distinct structurally from those comprising thin fimbriae and lacking a 

5 cysteine loop (Girardeau & Bertin, 1995; Mechin et a/., 1995). However, the AgfA C- 
terminal core is predicted to adopt a very unusual, regular repeating secondary structural 
pattern with most of the ten nonpolar-polar-nonpolar triplet motifs adopting extended 
secondary structure separated by short regions of coil structure. Salmonella AgfA, and 
the E. coli AgfA homologue, CsgA (Hammar et al., 1996), may be an extreme case in 

10 which the obvious, highly conserved amino acid residues in the primary sequence 
repeat motifs reflect a regular predicted secondary structure pattern. 

Several features of the AgfA primary sequence and predicted secondary 
structure suggest that AgfA probably adopts a tertiary structure distinct from the Figure 
tx-p roll described for the gonococcal pilin of Neisseria type IV pili. Notably, AgfA 

15 lacks the 55 amino acid hydrophobic, a-helical N-terminal region proposed to occupy 
an internal position in intact pili (Paranchych & Frost, 1988; Parge et a/., 1995). The 
four central anti-parallel P strands of the gonococcus pilin are flanked between the C- 
terminal portion of the 55-residue N-terminal a-helix and the externally exposed 
disaccharide-containing loop and disulfide region (Parge et al, 1995). AgfA lacks 

20 extensive regions of predicted a-helical structure and does not contain cysteine residues 
(Collinson et al., 1996a; Collinson et al., 1991). Post translational modification of 
AgfA has not been demonstrated but two putative NAT O-glycosylation sites exist at 
residues 80 to 82 and 115 to 117 analagous to the N(T/S)S O-glycosylation site of 
gonoccal and menigococcal pilins (Marceau et al, 1998). Predictably, the actual S. 

25 enteritidis AgfA fimbrin tertiary structure likely differs significantly from the a-p roll 
of gonnococcal pilin. 

The parallel p helix model of AgfA is the most alluring tertiary structure 
emerging from this study for several reasons. Firstly, this model predicts a very regular, 
compact structure congruent with the conserved, primary sequence repeats found in the 

30 C-terminus of AgfA. In addition, the proposed molecular mechanisms stabilizing the 
AgfA parallel p helix model structure are reminiscent of those of the di-strand P roll 



47 



motif of the S. marcescens metallo-protease on which this AgfA model was based 
(Baumann, 1994) and the stucturally related tri-strand parallel (3 helix motif of the 
pectate lyases (Lietzke et al, 1994; Pickersgill et al, 1994; Yoder & Jurnak, 1995) and 
Salmonella P22 phage tail spike protein (Steinbacher et al 9 1994; Yoder & Jurnak, 

5 1995) with potentially important differences. The predicted parallel p helix AgfA 
structure would be expected to be more stable than the above helical structures. This is 
because the parallel p helix model of AgfA predicts that four residues in each of the five 
coils would be internalized forming a larger hydrophobic core compared to the two 
residues normally seen in the previously described p roll and parallel p-helix motifs 

10 (Yoder & Jurnak, 1995). The intriguing arrangement of two internalized stacks of five 
Q and five N residues at postions 7 and 12, respectively, that flank the relatively sharp 
turn centered at the conserved G residue at position 9 of each coil could apparently 
stabilize the turns by extensive H-bonding with the peptide chain backbone (Figure 
1 lc). Similarly, two internalized stacks of five S and five Q residues at positions 1 and 

15 18 flank the turns joining each coil. The pectate lysases, which demonstrate polar 
stacks of up to 6 N residues and 3 S residues merely possess a single polar stack on one 
side of each turn (Yoder & Jurnak, 1995) . Similarly, ribonuclease inhibitor, a protein 
containing leucine-rich repeats folded in a helix of alternating strand and a helical 
regions possesses an N ladder at one end of the strand motif _ ADDIN ENRef (Kobe 

20 & Deisenhofer, 1993)_. Conversly, the barrel and prism AgfA model structures 
apparently lack certain molecular interactions important for the stability of respective 
template proteins. The PMP barrel structure possesses many hydrophilic residues that 
project into the interior of the barrel and form an extensive network of hydrogen bonds 
and salt bridges with a non-central hydrophobic pocket within the barrel structure that 

25 interacts with lipids _ ADDIN ENRef JCowan et al., 1993; Jones et al., 1988)_. 
Furthermore, the AgfA barrel model is less extensively hydrogen bonded than its PMP 
template. In addition, AgfA lacks cysteine residues, eight of which apparently stabilize 
the prism structure of VMO-I and enhance heat stability by forming four internal S-S 
bridges _ ADDIN ENRef (Shimizu & Morikawa, 1996)_. 

30 Thirdly, the conservation of residues in the AgfA and CsgA fimbrin 

homologues is a strong evolutionary argument in support of the AgfA parallel helix 
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model. The hypothetical contribution of the internal, conserved polar and nonpolar 
residues to the stability of this model is further strengthened by the fact that 37 of these 
40 residues are identical in the AgfA homologue, CsgA. The residues forming the 
stacks of internalized polar residues are perfectly conserved and there is only one non- 

5 conservative replacement of L for T in AgfA within the predicted hydrophobic core 
(Figure 15, Table 5). In this context, it is more difficult to envision the necessity for 
conserving these residues in the proposed barrel model of AgfA which predicts internal 
positions of these polar residues but without an obvious role in stability given the less 
extensive participation in main chain hydrogen bonding. Similary, structural integrity 

10 and stability of the prism model would not conceivably require the conservation of N 
and Q residues which are predicted to be mainly surface exposed in this model. 
Conservation of residues is a strong argument for eliminating barrel and the prism 
models. If AgfA forms a parallel helix structure, this would be the first fimbrin 
recognized to do so. 

15 Parallel sheet structures, like those described for the proposed AgfA 

parallel helix model are often associated with protein-protein interactions. Although 
metalloproteases _ ADDIN ENRef (Baumann et aL 9 1995; Baumann et al, 1993) and 
pectate lyases (Yoder & Jurnak, 1995) form coiled fold motifs that do not exist in 
multimeric form, the P22 tailspike protein forms a native trimer that is resistant to 

20 proteases, thermostable beyond 80°C and is not dissociated by exposure to SDS 
(Steinbacher et aL, 1994). The stability of the P22 tailspike trimer is attributed to the 
interaction of the hydrophilic p sheet surfaces (Steinbacher et ah, 1994). The 
possibility that the unusual stability of polymerized AgfA and the aggregative nature of 
the fimbriae is due in part to electrostatic interactions between monomers is consistent 

25 with the observed resistance of thin aggregative fimbriae to depolymerization by 
treatment with SDS (Collinson et al. 9 1991), octylglucoside or Tween-20 (Collinson, 
unpublished data). Salt bridges as well as hydrophobic interactions contribute to the 
maintenance of trimeric forms of carbonic anhydrase, the monomers of which assume a 
left hand parallel p helix structure (Kisker et aL, 1996). Very little is known concerning 

30 the molecular basis for fimbrin polymerization and stability. Forces important in the 
Neisseria Type IV model implicate a combination of a hydrophobic N-terminus in the 
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central core of the fiber and electrostatic forces stabilizing subunit-subunit interactions 
in the globular head (Parge et a/., 1995). Fimbrins and pilins are not covalently linked 
but assemble into very stable structures resistant to proteases and chemical denaturants 
such as chaotropes and alkali (Collinson et aL 9 1991; Eshdat et aL, 1981; Ho et ah, 

5 1990; Korhonen et ah, 1980; Parge et al. 9 1995). The distribution of those amino acid 
residues predicted to comprise the external face of the amphipathic P-strands in the 
parallel (3 helix AgfA model are primarily polar with hydrophobic, acidic and basic 
"patches" consistent with a requirement for interactive surfaces to stabilize polymers of 
AgfA. The glycine rich N-terminal region of AgfA, while susceptible to protease 

10 digestion, also posses a sequence consistent with two glycine loops as defined by 
Steinert et al (1991) which have been hypothesized to form interactions that stabilize 
higher stuctures. The parallel p helix model of AgfA offers an interesting solution to a 
multifunctional fimbrin molecular surface that must be able to have regions for subunit- 
subunit interaction, solvent exposed areas, interaction with minor subunits and fiber- 

1 5 fiber interactions common to many fimbriae. 

The three all-P class tertiary structure models of AgfA enable rational 
experimental design to facilitate investigations into Salmonella thin aggregative 
fimbriae structure and biogenesis. The quest to obtain soluble AgfA monomers for 
future physical analysis or structural studies may be aided by considering the 

20 information offered by the hypothetical AgfA model structures, especially that of the 
AgfA parallel p helix model. Optimal conditions for depolymerization of thin 
aggregative fimbriae in conditions less severe than 90% formic acid or prevention of 
AgfA polymerization by introducing minor tertiary structural perturbations by site 
specific mutagenesis may be ways to obtain soluble fimbrin monomers. Selective 

25 mutagenesis of AgfA to identify structurally and functionally important regions of this 
fimbrin is currently in progress (White et al, 1999). Site specific mutagenesis to alter 
key amino acid residues conserved in the AgfA primary sequence should provide 
important information on fimbrin structure and stability and the importance of each of 
the five C-terminal repeat segments in AgfA. Site specific mutagenesis of the K99 

30 fimbrin penultimate Y prevents fiber formation (Simons et al, 1990). The role of the 
terminal Y of AgfA is not known, however, this residue is positioned 3 and 5 residues 



50 

C-terminally to the two hydrophobic residues of the tenth p-strand and 9 residues from 
F122, an arrangement reminiscent of the less conserved |3-zipper motif of fimbrins 
assembled by the FGL chaperones (Hung et ah, 1996). The possibility of chaperone 
involvement in AgfA assembly has not been ruled out. Thus, the AgfA structural 
5 models presented herein provide a theoretical basis from which to further our 
knowledge of the structure and biogenesis of the stable, thin aggregative fimbriae of 
Salmonella. 
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20 Example 2 

High efficiency gene replacement in Salmonella enteritidis: Chimeric fimbrins 
containing a t cell epitope from leishmania major. 
The present invention provides for a simple, highly efficient method for 
gene replacement within the chromosome of Salmonella. This method is advantageous 

25 because it allows for gene replacement at native sites in the chromosome without the 
need for specific recombination-proficient bacterial strains or the use of selectable 
markers within the recombinant genes. Thus, the incorporation of non-target DNA in 
the chromosome is avoided. To illustrate the utility of this method, sefi and agfA, 
encoding the fimbrin subunit proteins of SEFI 4 [8] and thin aggregative fimbriae 

30 (SEF17) [11], respectively, were engineered to contain the DNA sequence encoding an 
immunoprotective T cell epitope from the Leishmania major surface protein, GP63 
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[25] , and were replaced into the chromosome of S. enteritidis. To our knowledge, this 
study presents the first chimeric Salmonella fimbrin genes and reports the first example 
of chimeric fimbrin genes reconstituted into the chromosome of an otherwise wild-type 
organism. 

5 

Materials and Methods 

Bacterial strains, media, and growth conditions . S. enteritidis 27655 
strain 3b has been previously described [16]. E. coli XL-1 Blue (Stratagene) was used 
as the host for pTZ18R, pHSG415, pGEM-T, pGPl-2 and their derivatives (Table 6). 

10 

Table 6. Plasmids used in this study. 



Plasmid 


Description and relevant genotypes 


Reference/Source 


pTZ18R 


Standard cloning vector with T7 promoter; Ap R 


Pharmacia 
Biotech Inc. 


P HSG415 


Temperature-sensitive pSClOl ori; Cm R , Km R , Ap R 


[20] 


pGEM-T 


A-T cloning vector; Ap R 


Stratagene 


pGPl-2 


T7 RNA polymerase cloned under the control of lp L 
andcI857,P15A ori; Km R 


[44] 


pTZ18pil 


pTZ18R, sefABC 


[8] 


pHAG 


pUC 1 8, agfBAC 


[9] 


pTZSef 


pTZ18R, sefA 


This study 


pTZSPIO 


pTZ18R, sefA::PT3 


This study 


pTZAgf 


pTZ18R, agfA 


This study 


pTZAP7 


pTZ\8R, agfA: :PT3 


This study 


pHSSPIO 


pHSG415, sefA::PT3 


This study 


pHSAP7 


pHSG415, agfA: :PT3 


This study 



E. coli XL-1 Blue harboring recombinant plasmids was grown in Luria- 
15 Bertani (LB) broth [39] or Terrific broth (TFB) [40] for 20-24 h at 28°C, 37°C or 42°C 
as specified below. Media were supplemented with ampicillin (Ap, 100 mg ml" 1 ), 
kanamycin (Km, 50 mg ml" 1 ), 5-bromo-4-chloro-3-indolyl-(3-D-glactopyranoside (X- 
gal, 40 mg ml" 1 ), or isopropyl-p-D-thiogalacto-pyranoside (IPTG, 1 mM) as required. 



60 



SefA or AgfA fimbrins were analyzed from S. enteritidis cells grown in 
CFA broth [15] or T broth [11] statically at 37°C for 48 h or incubated on T medium 
(T) and T medium Congo red indicator plates (TCR) as previously described [10]. 

Recombinant DNA Techniques and Sequencing . Electroporation of & 
5 enteritidis and E. coli using purified plasmids (QIAprep spin kit, Qiagen) was 
performed using standard techniques (Gene pulser electroprotocol, BioRad). 
Recombinant plasmids were routinely purified using standard alkaline lysis plasmid 
preps [40]. Restriction enzyme digestions (New England Biolabs) and ligation 
reactions (Gibco-BRL, Stratagene) were performed as described by the manufacturers. 
10 DNA fragments or PCR products were separated by agarose gel electrophoresis and 
visualized by staining with ethidium bromide [40]. DNA fragments purified from 
agarose gels using the Sephaglas bandprep kit (Pharmacia) or QIAquick gel extraction 
kit (Qiagen) were used for subcloning and sequencing. All PCR and sequencing 
primers were synthesized using an Applied Biosystems Model 391 PCR-MATE EP 
1 5 DNA synthesizer (Applied Biosystems Inc.). 

Polymerase Chain Reaction . The two step overlap extension PCR 
reaction [22] to generate sefA::PT3 or agfA::PT3 required four primers, A, B, C, and D 
(Figure 17). Primers B and C were the internal primers containing 48 nucleotides 
encoding the PT3 epitope followed by 30 nucleotides of sefA or agfA sequence at the 
20 desired site of insertion, and A and D were the two external primers containing EcoRl 
or Hindlll restriction sites (Table 7). In the first PCR step, two separate 100 ml PCR 
reactions were performed using primers A/B and C/D, respectively (Figure 17). Each 
amplification reaction contained 20 ng of plasmid template DNA, 50 pmol of each 
primer, 0.4 mM of each deoxynucleotide triphosphate (Boehringer Mannheim) and 2.5 
25 U Pfu DNA polymerase (Stratagene) in buffer supplied by the manufacturer. The 
second PCR step required 50 ng each of the two purified PCR products from the first 
PCR step as template DNA with 50 pmol of each external primer. After an initial 4 min 
denaturation step at 94°C, the Pfu enzyme was added and thermocycling was performed 
in a PTC-100TM Programmable Thermal Controller (MJ Research Inc.) with 30 cycles 
30 of denaturation (94°C, 45 s), annealing (55°C, 45 s), elongation (72°C, 1 min), followed 
by a 10 min elongation at 72°C. Wild-type sefA or agfA genes were amplified using the 
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external primers (A and D) only under the same conditions as the first PCR step 
described above. 



Table 7. PCR primers used to generate sefA::PT3 or agfA::PT3' ( 

5 



Primers b 


Length 


Sequence 0 (5'-3') 


14-A 


39 


TTGGAATTCTTCTTAAATTTTTAAAATGGCGTTGAGTAT 


14-B 


78 


AGCATGAGCCA 111 CA 1 u 1 G 1 AAUAACACvj lul AALuAUt i urt. i ^ a 
TATGCAATAGTAAC CGCTGCCTGAACCACTGC 


14-C 


78 


TATGATCAGCTCGTTACACGTGTTGTTACACATGAAATGGCTCATGC 
TGGGCCTGCTGTT GCTGCTGGTCAGAAAGTT 


14-D 


39 


ATTAAGCTTATACATAATCCCTCTTTAAGTTTTTGCATG 


17-A 


39 


GCAGAATTCAGCAGTTGTAGTGCAGAAACAGTCGCATAT 


17-B 


78 


TGCATGTGCCATTTCATGGGTAACAACACGGGTAACCAGCTGATCA 
TAGTTTTTAGCGTT CCACTGGTCGATGGTGGC 


17-C 


78 


TATGATCAGCTGGTTACCCGTGTTGTTACCCATGAAATGGCACATGC 
AAATCAGACCGCA TCTGATTCCAGCGTAATG 


17-D 


39 


AGACGCAAGCTTCGTTTAATGTGACCTGAGGGATCACCG 



3 Primers used to generate sefA::PT3 or agfA::PT3 as noted in Fig 17. 

b Primers prefixed with 14- or 17- used to generate sefA (SEF14) or agfA (SEF17) recombinants, 
respectively. 

1 0 c Underlined sequence corresponds to the 48 bp PT3 DNA sequence; bold letters correspond to EcoRI 
(GAATTC) or Hindlll (AAGCTT) restriction endonuclease sites. 



To amplify regions surrounding sefA and agfA in Ap s S. enteritidis 
strains after gene replacement, primers INI (5'-GGG ATG TTG TGT AAA GAT AAA 

15 AAA ATA GTG-3') and IN2 (5'-TGC CCA ATC TTA GGC CAT AAT ATT TTT 
GTG-3') or TAF59 (5'-AGG AAG GAT CAA AAC TAT TGT CCG TTA TTT GAG- 
S') and TAF60 (5'-TAT ATT TAC ACT AAG ACG AGA CAA CTC AAT CGG-3') 
were used ? respectively. To obtain template DNA for each S. enteritidis strain 
sequenced, cells from a 1 mL overnight LB culture were harvested and boiled for 10 

20 min. in 1 mL dH 2 0. 100 ml PCR reactions contained 20 ml of the boiled whole cell 
supernatant, 50 pmol of each primer, 0.2 mM of each deoxynucleotide triphosphate and 
4 U of Taq DNA polymerase (Boehringer Mannheim) in buffer supplied by the 
manufacturer. After an initial 5 min denaturation step at 95°C, Taq enzyme was added 
and thermocycling was performed as above with 30 cycles of denaturation (94°C, 1 
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min), annealing (55°C or 60°C, 1 min), elongation (72°C 5 1 min), followed by a 10 min 
elongation at 72°C. 

Gene replacement procedure in S. enter itidis 3b « S. enteritidis 3b 
containing pHSSPIO or pHSAP7 was grown for 24 h in 5ml TFB/Ap (42°C, 250 rpm). 
5 5 ml of this culture was used to inoculate 5 ml TFB/Ap and cells were grown at 42°C 
for 24 h; this transfer step was repeated four times. To select Ap R colonies, dilutions of 
the final 42°C culture were plated on LB/Ap plates and grown overnight at 42°C. 
pHSSPIO and pHSAP7 cointegrate colonies were grown individually in 5 ml TFB 
(28°C, 250 rpm, 24 h). 5 ml of each culture was used to inoculate 5 ml TFB and cells 
10 were grown at 28°C for 24 h; this transfer step was repeated four times. Serial dilutions 
from each of the final 28°C cultures were plated on LB medium and incubated at 28°C 
for 24 h. To select Ap s colonies, isolated colonies were picked from LB and replica- 
plated onto LB/Ap plates. 

DNA sequence analysis . Recombinant sefA::PT3 and agfA::PT3 
15 generated by PCR were cloned into pTZl 8R for sequence analysis. For analysis of Ap s 
S. enteritidis strains after gene replacement, PCR-generated products from the 
chromosome containing sefA or agfA and surrounding region were ligated into the A-T 
cloning vector, pGEM-T. To ensure the correct DNA sequence was obtained and to 
resolve errors due to Taq DNA polymerase, three individual pGEM-T isolates from 
20 each & enteritidis strain were sequenced. Plasmid DNA used for sequence analysis was 
prepared using the QIAprep spin miniprep kit (Qiagen, CA). Sequencing primers used 
were the Ml 3 universal forward and reverse primers. Sequencing was carried out using 
an Applied Biosystems Model 377 DNA Sequencing System and the PRISM Big Dye 
Primer or Big Dye Terminator Cycle Sequencing Ready Reaction Kit (Perkin-Elmer). 
25 Sequence data was analyzed using DNASTAR Lasergene software (DNASTAR, Inc.). 

Expression of chimeric fimbrin genes in E. coli. Expression of 
sefA::PT3 and agfA::PT3 in E. coli using T7 RNA polymerase was performed 
according to the method of Tabor and Richardson [44]. E. coli XL-1 Blue was co- 
electroporated with purified plasmids pTZAP7, pTZAgf, pTZSPIO, pTZSef or pTZ18R 
30 (control) along with pGPl-2. Cultures were grown in TFB/ApKm for 18 h (28°C, 200 
rpm), one volume of pre-warmed media was added and the cultures were grown at 42°C 
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for 3 h to induce production of the cloned gene products. Cells were harvested at 4,000 
rpm, normalized to 1.0 A 600 , and processed as reported for S. enteritidis 3b [10] before 
SDS-PAGE and Western blotting. 

Preparation of immune serum . Purified synthetic PT3 peptide was 
5 coupled to soluble keyhole limpet hemocyanin (KLH; Sigma) using glutaraldehyde 
[19]. Purified KLH-PT3 (330 mg) was resuspended in dH 2 0 and emulsified in 
complete Freund's adjuvant prior to subcutaneous and intramuscular injections of a 
female New Zealand White rabbit. Two booster doses of KLH-PT3 (330 mg) 
emulsified in incomplete Freund's adjuvant were given at 4 week intervals. Titers of 
10 the immune serum were determined by enzyme-linked immunosorbent assay (ELISA) 
against the PT3 peptide. The rabbit was exsanguinated 4 weeks following the final 
booster injection. This serum was passaged over an immunoaffmity column which was 
prepared by coupling PT3 to CNBr-activated Sepharose 4B (Pharmacia) following 
procedures outlined by the manufacturer. The eluate from this column was pooled, 
15 tested for reactivity to PT3 by ELISA, and concentrated 10-fold using an ultrafiltration 
cell (Amicon model #8010) with a YM10 membrane (Amicon). This concentrated 
antibody solution was used as the final PT3-specific immune serum. Immune sera to 
whole SEF14 and SEF17 fimbriae have been previously described [8,1 1]. 

SDS-PAGE and Western blot analysis . For analysis of S. enteritidis 
20 strains after sefA::PT3 gene replacement, cells from 1ml of 1 A 600 culture were 
harvested, resuspended in 200 ml of SDS-PAGE sample buffer supplemented with 0.2 
M glycine (pH 2), and boiled for 10 min. This cell extract was clarified by spinning in a 
microcentrifuge (13,200 rpm, 5 min.) and used directly for SDS-PAGE analysis [7]. 
For analysis of S. enteritidis strains after agfA::PT3 gene replacement, cells were 
25 scraped from TCR plates using a glass slide and resuspended in 1 ml Tris buffer (1 OmM 
Tris, pH 7.5). Aliquots of this cell slurry were mixed with an equal volume of 2 x SDS- 
PAGE sample buffer supplemented with 0.2 M glycine (pH 2) and boiled for 10 min. 
The cell extract was clarified as above and used directly for SDS-PAGE. The insoluble 
glycine extracted cell material was washed 3 x in 0.5 ml dH 2 0 and treated with 90% 
30 formic acid before SDS-PAGE analysis as previously described [11]. For analysis of 
acetone-precipitated culture supernatant proteins, 1ml of T broth culture was clarified as 
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above and 250 ml of culture supernatant was aliquoted and mixed with 1 ml of ice-cold 
acetone. The precipitated proteins were sedimented (13,200 rpm, 20 min, 4°C), the 
supernatant was discarded and the protein pellet was dried for 20 min under vacuum. 
The pellet was resuspended and boiled for 10 min in lx SDS-PAGE sample buffer 

5 supplemented with 0.2 M glycine (pH 2) before loading onto SDS-PAGE. SDS-PAGE 
was carried out according to the method of Laemmli [27] with a 5% stacking gel and 
12% resolving gel. Proteins separated by SDS-PAGE were electrophoretically 
transferred to nitrocellulose using an LKB Multiphor II Electrophoresis System 
(Pharmacia Biotech) and were detected by Western blot techniques using fimbriae- or 

10 PT3-specific antisera and goat-anti-rabbit immunoglobulin G-alkaline phosphatase 
conjugates (Cedarlane) as described previously [1 1]. 

Results 

Generation of chimeric S. enteritidis fimbrin genes The S. enteritidis 3b 

15 fimbrin genes, sefA and agfA were chosen as target DNA sequences to receive a site- 
specific epitope replacement. The two-step overlap extension PCR method of Horton et 
al. [22] was used (Figure 17A, B) to replace 48 bp of the fimbrin gene with 48 bp 
encoding the PT3 T cell epitope from L. major (Figure 17C). The PCR-generated genes 
encoded chimeric SefA and AgfA proteins carrying the PT3 epitope within specific 

20 regions of the mature protein (residues 18-33 in SefA and residues 111-126 in AgfA). 
The resultant sefA::PT3 and agfA::PT3 PCR products were sequenced to confirm the 
fidelity of PCR and were cloned into pHSG415 (Figure 18 A) to generate recombinant 
temperature-sensitive plasmids pHSSPIO and pHSAP7, respectively (Table 6). 

Gene Replacement in S. enteritidis. The recombinant temperature- 

25 sensitive pSCl 01 -derivative plasmids, pHSSPIO and pHSAP7, were used to replace the 
native genes in S. enteritidis 3b (Figure 18B). Cells were transformed with pHSSPIO or 
pHSAP7 and directly plated on selective media (LB/Ap) at the non-permissive 
temperature (42°C), but virtually no plasmid cointegrates were obtained. Therefore, 
cells were grown 24h in liquid media (TFB/Ap) and transferred five times into fresh 

30 media for a total growth period of 120 h before plating on LB/Ap. Of the 50 isolated 
colonies picked for both pHSSPIO and pHSAP7, 100% of colonies were identified by 
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PCR to contain an integrated plasmid. Three pHSSPIO cointegrate colonies and four 
pHSAP7 cointegrate colonies were then selected and grown at the permissive 
temperature of 28°C in liquid media (TFB) without Ap selection pressure for 24 h and 
tranferred three times into fresh media for a total growth period of 72 h. Dilution 

5 plating of these final colonies resulted in the same number of CFU on both LB and 
LB/Ap plates, suggesting that the majority of cells were still Ap R (42°C). However, 
after an additional growth period of 48 h (120 h total) at 28°C, a high percentage of Ap s 
colonies was obtained. 184 isolated colonies were picked from each cointegrate culture, 
yielding 65%, 55%, and 5% Ap s from the three pHSSPIO cointegrates, and 27%, 15%, 

10 10% and 9% Ap s from the four pHSAP7 cointegrates (Table 8). pHSSPIO and 
pHSAP7 cointegrate colonies were also grown at the non-permissive temperature 
(42°C) without Ap selection pressure (120 h). In contrast to the cultures grown at 
28°C, no Ap s colonies were obtained. 



15 Table 8. Efficiency of sefA::PT3 and agfA::PT3 gene replacement in S. enteritidis. 



Plasmid cointegrate colonies 


Ap s colonies 3 # 


Chimeric gene % 


replacements 

(%) 


pHSSPIO 








Isolate #1 


119 


65 


33 b 


Isolate #2 


8 


4 


2 b 


Isolate #19 


101 


55 


23 b 


pHSAP7 








Isolate #2 


18 


10 


T 


Isolate #3 


28 


15 


10 c 


Isolate #9 


49 


27 


4 C 


Isolate #10 


17 


9 


8 C 



a 184 individual colonies picked in total. 

b Frequency of sefA::PT3 containing S. enteritidis estimated by preliminary PCR results. 
20 c Frequency of agfA::PT3 containing S. enteritidis estimated by CR binding morphology (TCR). 



Genetic analysis of S. enteritidis strains containing chimeric sefA::PT3 
or azfA::PT3 fimbrin genes . To identify S. enteritidis strains containing the chimeric 
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sefA::PT3 gene, twenty-eight of the final Ap s colonies were chosen and analyzed by 
PCR. Sixteen colonies were identified as having the PT3 DNA sequence within sefA 
and all twenty-eight colonies were negative for the pSClOl ori sequence found in 
pHSG415. Furthermore, PCR products were the same size as PCR products of sefA 

5 from S. enteritidis 3b indicating that no extra plasmid sequence was inserted in this 
region. Four S. enteritidis strains were further analyzed by Southern blotting using 
DNA probes specific for the PT3 DNA sequence, the region within sefA replaced by the 
PT3 DNA sequence, the region in sefA 3' to the PT3 insertion site, or the entire £coRI 
and Hindlll digested pHSG415 plasmid sequence. These data confirmed the PCR 

10 results; two of the strains contained sefA::PT3, two strains contained wild-type sefA, 
and all four strains did not contain any pHSG415 plasmid sequence. Final confirmation 
of the genotype of these four strains was obtained by DNA sequencing (see Materials 
and Methods), proving that all four strains contained the correct, predicted gene 
sequences. 

15 To identify S. enteritidis strains containing the chimeric agfA::PT3 gene, 

five of the final Ap s colonies were chosen and were analyzed by PCR. Three strains 
were identified as having the PT3 DNA sequence within agfA and all five strains were 
negative for the pSClOl ori sequence as above. PCR products from each strain were 
the same size as PCR products of agfA from S. enteritidis, indicating that there was no 
20 additional plasmid sequence inserted in this region. Final confirmation of the genotypes 
of these strains was obtained by DNA sequencing proving that three strains contained 
agfA::PT3 and two strains contained wild-type agfA. Thus, these data confirmed the 
creation of S. enteritidis strains containing chimeric sefA::PT3 or agfA::PT3 fimbrin 
genes by precise recombination events. 

25 Plasmid-based expression of sefA::PT3 and aefA::PT3 in E. coll To 

ensure that the chimeric genes used for gene replacement in S. enteritidis could be 
readily transcribed and translated into protein, these genes were expressed in E. coli 
XL-1 Blue utilizing T7 RNA polymerase [44]. Expression of the sefA and sefA::PT3 
genes from pTZSef and pTZSPIO was demonstrated by Western blot analysis (Figure 

30 19A). Both the recombinant SefA::PT3 proteins and wild-type SefA proteins were 
expressed at comparable levels (Figure 19A, lanes 2 and 3) and two distinct 
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immunoreactive protein bands (14 kDa and 16-17 kDa) were seen in each lane. The 
larger protein was thought to be unprocessed wild-type or chimeric SefA possessing the 
22-amino acid signal sequence as observed previously [37]. The smaller protein was 
identified as the mature form of the fimbrin, since the band co-migrated with SefA from 
5 purified SEF14 (Figure 19 A, lane 4). Repeat of these blots with PT3 -specific immune 
serum did not reveal either the 14 or 16 kDa protein band in lane 3, suggesting that the 
PT3 epitope was inaccessible or not exposed on the surface of the chimeric SefA::PT3 
protein. 

Western blotting also showed expression of the agfA and agfA::PT3 

10 genes from pTZAgf and pTZAP7 in E. coli (Figure 19B). Both the recombinant 
AgfA::PT3 proteins and wild-type AgfA proteins were observed in SDS-PAGE sample 
buffer-glycine-insoluble material from whole cells treated with formic acid (Figure 
19B, lanes 5 and 6). Both were expressed at comparable levels and were the same size 
as mature AgfA from purified thin aggregative fimbriae (17 kDa; Figure 19B, lane 1), 

15 although a second 10 kDa band was seen for AgfA::PT3 (Figure 19B ? lane 6), 
suggesting that some degradation had occurred. AgfA was not detected in SDS-PAGE 
sample buffer-glycine extracts of whole cells (Figure 19B, lane 2). In contrast, 
AgfA::PT3 was observed in these samples (Figure 19B; lane 3), indicating that 
AgfA::PT3 was more soluble than wild-type AgfA. Repeat of these blots with PT3- 

20 specific immune serum confirmed that the bands in lanes 3 and 6 were indeed chimeric 
AgfA::PT3 protein. 

Chromosome-based expression of sefA::PT3 in S. enteritidis. Western 
blot analysis of the four S. enteritidis strains which were shown to contain either 
sefA::PT3 or sefA after gene replacement is shown in Figure 20, Strains containing 

25 sefA::PT3 (#13 and 192) were negative for the chimeric fimbrin protein SefA::PT3 
(Figure 20, lanes 3 and 4), whereas strains containing sefA (#1 1 and 197) were positive 
for the wild-type fimbrin protein SefA (Figure 20, lanes 2 and 5). All strains were 
positive for the expression of FimA (21 kDa), the subunit protein of Type 1 fimbriae 
(Figure 20, lanes 2-6). The predicted SefA::PT3 protein was not found when Westerns 

30 were repeated using a mixture of monoclonal antibodies against SefA. Furthermore, 
SefA::PT3 was not being secreted into the culture supernatant since trichloroacetic acid 
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(TCA) precipitation of S. enteritidis culture supernatant followed by SDS- 
PAGE/Western blotting did not reveal the recombinant protein. Further Western 
analysis of strain #13 confirmed that the strain was not producing SefA::PT3 under 16 
different growth conditions. Similar results were obtained for all recombinant S. 
5 enteritidis strains that were PCR-positive for the sefA::PT3 gene. These results 
demonstrated that S. enteritidis strains containing sefA::PT3 were not expressing the 

chimeric fimbrin protein. 

Chromosome-based expression of asfA::PT3 in S . enteritidis. Since S. 
enteritidis 3b forms distinct dark red, aggregative colonies when grown on T media 
10 with Congo Red (TCR) and expression of AgfA (thin aggregative fimbriae) is 
associated with both properties [10], all Ap s S. enteritidis strains obtained after 
agfA::PT3 gene replacement were streaked onto TCR plates. Based on previously 
established nomenclature [10], three distinct phenotypes were observed: a) red, 
aggregative (rough) (RR) colonies, b) orange, non-aggregative (smooth) (OS) colonies, 
15 and c) pink, non-aggregative (smooth) (PS) colonies; of 1 12 Ap s colonies obtained and 
characterized, 60 were RR, 39 were OS, and 13 were PS. The two S. enteritidis strains 
shown to contain wild-type agfA sequence (#91 and 104) were identified as being RR 
like S. enteritidis 3b. On the other hand, of the three S. enteritidis strains shown to 
contain agfA::PT3 sequence, two displayed the OS phenotype (#102 and 103), while the 
20 third displayed the PS phenotype (#27). 

Western blot analysis of these five S. enteritidis strains scraped from 
TCR plates (see Materials and Methods) is shown in Figure 21. As expected, the two 
RR strains expressed wild-type AgfA which was only present in SDS-PAGE sample 
buffer-glycine-insoluble material treated with 90% formic acid (Figure 21 B, lanes 2 and 
25 5), as demonstrated for S. enteritidis 3b (Figure 2 IB, lane 6). The two OS strains 
produced an immunoreactive 17 kDa protein (Figure 21 A or B, lanes 3 and 4), 
indicating they were producing chimeric AgfA::PT3 protein. Furthermore, the 17 kDa 
protein was present in both SDS-PAGE sample buffer-glycine extracts of whole cells 
and glycine-insoluble material treated with formic acid, confirming that AgfA::PT3 was 
30 more soluble than wild-type AgfA as previously observed in E. coli (above). 
Immunoreactive higher M r bands representing multimers of AgfA::PT3 were also 
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observed (Figure 21 B, lanes 3 and 4), suggesting that some polymerization of 
AgfA::PT3 subunits had occurred. In contrast to the OS strains analyzed, the PS strain 
#27 was negative for AgfA::PT3 when grown on TCR plates (Figure 22 A or B, lane 7). 

To investigate whether the S. enteritidis OS strains were secreting 
5 soluble AgfA::PT3 into the media, they were grown in T broth and the culture 
supernatant proteins were precipitated and analyzed by SDS-PAGE and Western 
blotting. Figure 22 represents replicate Western blots of these protein samples boiled in 
SDS-PAGE sample buffer with glycine, one reacted with immune serum generated 
against whole SEF17 (Figure 22, lanes 1-3) and the other reacted with immune serum 

10 generated against the PT3 peptide (Fig 22, lanes 4-6). Both OS strains revealed a 
soluble immunoreactive 18kDa protein which was recognized by both SEF17- and PT3- 
specific antiserum (Figure 22, lanes 2 and 5, 3 and 6), confirming that they were 
expressing chimeric AgfA:;PT3 and secreting large amounts into the media. Amino- 
terminal sequencing of the 18-kDa protein band from OS strain #102 yielded the 

15 GVVPQ sequence of mature AgfA [11], indicating that the size difference between 
these protein bands and wild-type AgfA from whole SEF17 (Figure 22 A, lane 1) was 
not due to a failure to cleave the signal sequence. Analysis of the PS strain under the 
same conditions revealed a faint immunoreactive band of similar size, suggesting that 
this strain was also expressing AgfA::PT3, but at a much lower level than the two OS 

20 strains. Like the two OS strains, amino terminal sequencing of this protein band 
yielded a major signal of GVVPQ. These results demonstrated the isolation of two 
genetically similar S. enteritidis strain types containing agfA::PT3 which express the 
chimeric fimbrin protein at different levels. 

The system for chromosomal gene replacement used for creation of 

25 Salmonella strains containing recombinant agfA was also used for creating Salmonella 
strains containing recombinant sefA. The sefA gene encodes SefA, the major subunit 
protein for SEF14 fimbriae produced by group D Salmonella spp. (Baumler and 
Heffron, 1995). SEF14 is a thin fimbriae like TAF, but unlike the TAF operon, the 
SEF14 operon encodes proteins which show homology to chaperone and usher proteins 

30 from other fimbrial systems (Clouthier et al, 1993). Therefore, SEF14 is proposed to 
assemble via the chaperone/usher pathway (Hultgren et al, 1996) and growth of the 
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fimbrial fiber is probably achieved from the base by addition of the subunits from the 
periplasmic side of the outer membrane (Mol and Oudega, 1996). 

SefA as a carrier of the PT3 epitope Ten different 16 amino acid 
segments within SefA were chosen for replacement with the PT3 epitope and are 
5 highlighted in Figure 23. These replacement regions were chosen using several criteria: 
primary sequence alignment, region SI (black); hydrophilicity, flexibility and 
accessibility plots, region S10 (orange); experimentally determined B-cell epitope 
regions, regions S2, S3, S4, and S5 (blue); and TnphoA mutagenesis data, regions S6, 
S7 5 S8 and S9 (green). In total, all 117 of 144 residues within SefA were replaced with 
1 0 the PT3 sequence in at least one of the chimeric fimbrin constructs. 

Generation of S. enteritidis strains containing the sefA::PT3 genes All 
recombinant sefA::PT3 genes were generated by PCR, sequenced and introduced into 
the chromosome of S. enteritidis, replacing the W* sefA gene, using the procedure 
described for agfA. 

15 Only S. enteritidis strains containing sefA::PT3 #1, 2, 4, 5, 6, 8, 9, 10 in 

the chromosome were obtained . Western blots were used to analyze S. enteritidis 
strains containing sefA::PT3 in the chromosome. The recombinant S. enteritidis strains 
SI, S2, S4, S5, S6, S8, S9, and S10 were grown on CFA plates at 37°C for 24 h and 
were analyzed for production of recombinant SefA proteins containing the PT3 epitope 

20 by SDS-PAGE and western blotting. 

In contrast to the results with agfA, NO recombinant SefA fimbrin 
proteins could be detected in any of the recombinant Salmonella strains containing 
recombinant sefA. These data demonstrated that SefA was not a suitable carrier of 
heterologous (foreign) peptide sequences and could not tolerate substitution of its 

25 residues from the N- to C-terminus of the protein. 

Since none of the recombinant SefA fimbrin proteins were able to be 
expressed in Salmonella, this provides evidence that fimbriae which assemble via the 
chaperone/usher pathway would not be good carriers of heterologous (foreign) peptide 
sequences. In addition, this finding further proves the utility and uniqueness of the 

30 AgfA/TAF presentation system described (i.e., the ability of AgfA to accept major 
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substitutions throughout its sequence and still have the ability to assemble into fimbrial 
fibers at the cell surface). 

Discussion 

5 The invention described herein provides an efficient and fascile strategy 

for the replacement of wild-type genes in the S. enteritidis chromosome with in vitro- 
altered versions. This gene replacement method allowed for efficient, site-specific 
chromosomal gene replacement without the need for selective markers in the target 
gene(s) or special bacterial strains for recombination, and avoided the introduction of 

10 non-target DNA sequences. The method was used to construct & enteritidis strains 
containing chimeric sefA and agfA fimbrin genes which carry foreign DNA encoding 
the PT3 epitope from L. major. The sefA::PT3 and agfA;:PT3 genes presented here 
represent the first chimeric fimbrin genes reported for the genus Salmonella. 

The gene replacement method reported in this study employs pHSG415, 

15 derived from a temperature-sensitive pSClOl replicon [20]. Although other gene 
replacement methods using temperature-sensitive pSClOl replicons [18,30] have been 
reported, their results were less favorable than those reported here. The method of 
Hamilton et ah [18] results in final strains containing freely replicating plasmids which 
must be cured before selecting gene replacements, whereas the present method results in 

20 a high proportion of final strains which have lost the excised plasmid altogether and can 
be screened directly by PCR. The method of Link et al [30] includes the Bacillus 
subtilis sacB system to enable direct selection for excision and loss of integrated 
plasmids. However, when creating deletion mutants of hdeA in E. coli without the use 
of selectable markers, these authors reported lower frequencies than are reported here 

25 (Table 8). In addition, the sacB system is complicated by the frequency of Suc R 
revertants [24,26]. 

A closer look at the properties of pHSG415 can help explain why this 
gene replacement method is so efficient. In contrast to pSClOl, pHSG415 was shown 
to be extremely unstable in liquid culture in the absence of selection pressure, with as 
30 few as 60% of cells carrying the plasmid after 24 hr [5]. These authors speculated that 
the difference between pSClOl and pHSG415 was due to the absence of most of the 
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par locus in pHSG415. The pSClOl par locus has been very well characterized; it has 
been shown to alter the binding of proteins within the origin region [23] and thus to 
enhance DNA replication [32]. In addition, it contains a preferential binding site for 
DNA gyrase [48] and alters the negative supercoiling of DNA [35]. Through a 

5 combination of these effects the par locus promotes partitioning of plasmids to daughter 
cells at the time of cell division [34]. Due to the deletion of the par locus, pHSG415 is 
proposed to have a defect in its segregating properties causing unequal partitioning of 
plasmid molecules to daughter cells at the time of cell division [5]. The present method 
uses the unstable properties of pHSG415 to advantage, and in the absence of selection 

10 pressure, integrated plasmids are lost immediately after the second crossover event or 
are retained for several generations and then lost. This would explain why an additional 
growth period of 48 hours (after the initial 72 hours) was sufficient to result in a high 
percentage of Ap s colonies when grown at 28°C. Thus, it is possible to replace sefA and 
agfA in the chromosome of a wild-type strain of S. enteritidis without any antibiotic 

15 resistance markers in the target genes at an efficiency of 30% and 10%, respectively. 

It is possible that the high gene replacement frequencies observed simply 
reflect the relative ease of sefA or agfA replacement. However, a recent study by 
Edwards et al [14] reported the replacement of sefA or agfA within S. enteritidis at 
lower frequencies than reported here, even with the use of a Km R marker in the 

20 recombinant genes. This supports the utility of the present method and demonstrates 
that the high replacement frequencies reported here are not simply due to the ease of 
sefA or agfA replacement. Therefore, this method has broad application and appeal for 
gene replacement in Salmonella and other enteric bacteria capable of supporting the 
replication of pSC 101 -derived plasmids. In addition, these results suggest that the use 

25 of similar "unstable" plasmids in other organisms could represent a more generalized 
mutagenesis strategy. 

The sefA::PT3 and agfA::PT3 genes presented here both carry a foreign 
DNA segment encoding PT3, a Tl^ -restricted T cell epitope comprising residues 154- 
168 from the GP63 protein of L. major [25]. This epitope was chosen because it was 

30 shown to be immunoprotective in Balb/c mice [41] and because studies with Salmonella 
vaccine strains expressing GP63 have demonstrated that oral vaccination against 
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Leishmania infection is possible [33]. Therefore, PT3 is a good candidate for 
developing and testing a heterologous fimbrial vaccine system in Salmonella. The 
subunit proteins of SEF14 and SEF17 fimbriae, SefA and AgfA, respectively, were 
chosen as the carriers of PT3 because they can be expressed at high levels and are 
5 surface-exposed [8,11], properties proven to be advantageous for other heterologous 
epitope presentation systems [28]. Moreover, SEF14 and SEF17 represent distinct 
classes of fimbriae in S. enteritidis 3b [31] and differ in their primary fimbrin sequence, 
probable tertiary structure, expression patterns, mode of assembly, and biochemical 
properties. 

10 Two S. enteritidis strains that contained sefA::PT3 were identified, 

genetically characterized and analyzed for expression of the chimeric fimbrin proteins. 
While SefA::PT3 protein was produced in E. coli, indicating that the sefA::PT3 gene 
construct had the correct coding sequence, the chimeric protein could not be detected in 
S. enteritidis. This could be due to a number of factors: alteration in the DNA topology 

15 of the chromosome effecting transcription initiation; production of an unstable RNA 
transcript; or expression of a significantly misfolded and rapidly degraded protein, due 
to the positioning of PT3 in the N-terminal region of SefA, a region important for 
dimerization of wild-type SefA subunits [6]. Experiments to identify more permissible 
sites for foreign epitope replacement within SefA are currently underway. 

20 Three S. enteritidis strains that contained agfA::PT3 were identified, 

genetically characterized and analyzed for expression of the chimeric fimbrin proteins. 
In contrast to the sefA::PT3 mutants, two of these strains were shown to produce the 
chimeric fimbrin protein at high levels. These strains displayed a distinct orange, non- 
aggregative (smooth) (OS) phenotype when plated on TCR medium, in contrast to the 

25 red, aggregative (rough) (RR) phenotype displayed by S. enteritidis 3b. Closer analysis 
of the AgfA::PT3 protein demonstrated that it had unexpected properties; whereas wild- 
type AgfA required pre-treatment in 90% formic acid to enter polyacrylamide gels, the 
chimeric AgfA::PT3 protein was readily seen by SDS-PAGE without formic acid 
treatment. The third strain containing agfA::PT3 had properties quite different than the 

30 other two. It produced much lower levels of chimeric AgfA::PT3 protein under the 
growth conditions tested and gave a third distinct phenotype when plated on TCR 
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medium, displaying pink, non-aggregative (smooth) (PS) colonies. This PS strain was 
shown to contain no sequence errors within agfA::PT3 and surrounding region (950 bp), 
thereby suggesting that gene replacement within the agfBAC operon of S. enteritidis can 
somehow affect fimbriae biosynthesis. The PS phenotype was similar to that previously 
5 reported for SEF17-deficient TnphoA mutants of S. enteritidis 3b [10]. Retailing et al. 
[38] observed similar pink, smooth colonies for S. typhimurium on CR media and 
showed that a single base pair change in the agfD promoter sequence was responsible 
for the mutant phenotype and lack of AgfA production. Allen- Vercoe et al. [1] have 
also recently described AgfA" S. enteritidis mutants with a pink, smooth phenotype on 

1 0 CR media and have characterized these strains as expressing a new, as yet undetermined 
fimbrial type. In light of these recent findings, the nature of the differences between the 
OS and PS agfA::PT3 strains are currently under investigation. 

SefA::PT3 and agfA::PT3 represent the first chimeric fimbrin genes 
reported for the genus Salmonella. These genes were replaced at the sites in the 

15 chromosome originally occupied by the wild-type fimbrin genes so as to maintain the 
appropriate relationship with any potential upstream regulatory sequences. Thus, the 
recombinant S. enteritidis strains that have been created are genetically identical to the 
strain with the exception of the 48 bp foreign DNA sequence. The gene 
replacement strategy presented allows for the rapid generation of similar S. enteritidis 

20 vaccine strains without the need for antibiotic resistance markers and/or the screening of 
thousands of colonies for a "rare" double crossover event. Strains created in this manner 
can then be used in future vaccine studies without the complications associated with 
extraneous DNA elements. 
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20 EXAMPLE 3 

DEMONSTRATION OF THE VACCINE POTENTIAL OF £ ENTERITIDIS 
DISPLAYING CHIMERIC FIMBRINS CONTAINING A T CELL EPITOPE FROM 

LEISHMANIA MAJOR. 

Introduction 

25 Several strains of & enteritidis carrying chromosomal chimeric fimbrin 

genes (agfA::PT3, above) were developed. The corresponding chimeric AgfA::PT3 
proteins were expressed and assembled into SEF17 fimbriae as described above. These 
strains display the PT3 T cell epitope from the gp63 protein of L. major in large 
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quantities at the S. enteritidis cell surface. Recent work with the PT3 epitope proved 
that a single immunization of peptide in adjuvant was able to protect BALB/c mice 
against challenge with L. major (Spitzer et ah, 1998). Thus, the S. enteritidis strains 
containing agfA::PT3 genes are promising candidates for the protection of BALB/c 
5 mice against L. major infection. 

Infection of BALB/c mice with L. major has been very well studied. 
The immune response generated in BALB/c mice in response to L. major infection is 
reviewed in Locksley et al (1999). To summarize, BALB/c mice are highly susceptible 
to L. major infection, whereas most other inbred mouse strains develop immunity and 

10 are able to control infection. Upon infection of BALB/c mice with L major the 
majority of parasite-specific CD4 T cells (T helper cells or Th) generated are of the type 
2 variety (Th2 cells) which push the immune response toward phagocyte-independent 
immunity and antibody production. In contrast, mice that resist L. major infection 
generate a majority of Thl cells which are required for activation of phagocyte- 

15 dependent immunity or cell-mediated immunity. The balance between susceptibility 
and resistance to L. major infection seems to rely on the relative numbers of these two 
Th subspecies. It should be noted that the PT3 epitope was demonstrated to specifically 
induce proliferation of CD4 T cells of the Thl subset (Jardim, 1994). 

Materials and Methods 

20 Animals. Inbred BALB/c mice were initially obtained from The Jackson 

Laboratory (Bar Harbor, ME) and bred at the McGill Research Institute (Montreal, 
Quebec). Mice used for L major vaccine trial were 8-10 weeks of age. 
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Bacterial Strains. Parasites. Media and Growth Conditions. S . enteritidis 
27655 strain 3b has been previously described (Feutrier et al, 1986). S. enteritidis 
strains A4, A5, and A8 were described above. Leishmania major carrying a reporter 
gene (Ha et al, 1996) to simplify measurement of parasite number was used as the 
5 virulent challenge strain. 

For analysis of fimbrial production, S. enteritidis A4, A5, and A8 were 
grown in 10 ml T broth (Collinson et al, 1991) or trypticase soy broth (TSB; Becton- 
Dickinson) statically or 200 rpm for 6, 24 or 48 h at 37 C. To prepares, enteritidis for 
immunization of BALB/c mice, cells were grown in 5 ml TSB for 6 h at 37°C, 250 rpm 
10 and 100 ul used to inoculate 100 ml T broth and incubated for a further 48 h at 37°C 
and 200 rpm. Cell density was determined by dilution plating onto trypticase soy agar 
(TSA). Subsequently, cultures were diluted to appropriate cell density in phosphate 
buffered saline (PBS) and used for immunization of BALB/c mice. 

L. major was grown in SDM-79 medium (Brun and Schonenberger, 
1 5 1 979) supplemented with 1 0% fetal bovine serum and 5 ug/mL hemin. 

Preparation of Protein Samples For SDS-PAGE. S. enteritidis A4, A5, 
A8 and 3b broth cultures were normalized to 1 Ag 00 and 1 ml cells were harvested, 
resuspended in 200 ul of SDS-PAGE sample buffer supplemented with 0.2 M glycine 
(pH 2), and boiled for 10 min. This cell extract was clarified by spinning in a 
20 microcentrifuge (13,200 rpm, 5 min.) and used directly for SDS-PAGE analysis. The 
insoluble glycine-extracted cell material recovered by centrifugation was washed with 
dH 2 0 and treated with 90% formic acid (Collinson, 1991; Collinson, 1993) before 
analysis by SDS-PAGE. 
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Gel Electrophoresis and Immunoblotting Procedures. SDS-PAGE was 
carried out according to the method of Laemmli (1970) with a 5% stacking gel and 12% 
resolving gel. Proteins separated by SDS-PAGE were electrophoretically transferred to 
nitrocellulose using an LKB Multiphor II Electrophoresis System (Pharmacia Biotech). 
5 Proteins were detected by Western blot techniques using SEF17-specific immune serum 
and goat-anti-rabbit immunoglobulin G-alkaline phosphatase conjugates (Cedarlane) as 
described previously (Collinson et al., 1991). 

Vaccination of Mice Withff. enteritidis . BALB/c mice were deprived of 
food 16-18 h and water 1 h before vaccination. Immediately before administration of 

10 Salmonella or PBS, mice were administered 50 pi of 1% sodium bicarbonate orally. 
Two groups of 15 mice (males and females) were each then immunized orally with 10 5 
CFU of S. enteritidis A4 or 3b in 200 pi PBS. Two groups of control animals received 
200 pi PBS only. Two weeks after the first vaccination, one S, enteritidis 3b group 
received a second dose of 10 3 S. enteritidis 3b whole cells in 200 pi PBS, one 5. 

15 enteritidis A4 group received a second dose of 10 3 S. enteritidis A4 whole cells in 200 
jal PBS, and one control group received 200 pi PBS only. All oral inoculations were 
carried out with a stainless steel gavage needle without an anaesthetic. Mice were 
monitored daily for signs of infection. 

Challenge With L. major, 5 vaccinated mice from each of the six 

20 vaccinated groups were challenged with 10 7 live L. major in 50 pi of PBS four or six 
weeks after the first vaccination or with 5 x 10 6 live L major in 50 pi of PBS eight 
weeks after first vaccination by subcutaneous injection in the right hind footpad. 
Infected footpad and noninfected footpad sizes were measured each week after I. major 
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infection until the end of the experiment using DigiMax electronic calipers. The net 
value for footpad inflammation (in millimeters) was calculated by subtracting the 
diameter of the noninfected footpad from the diameter of the infected footpad. 
Ulceration of infected footpads was also noted. 
5 At the end of the experiment, all mice were sacrificed by overexposure to 

C0 2 . Blood was collected by cardiac puncture and the spleen was removed. In 
addition, the popliteal ganglion from the right leg, the large lobe of the liver, the 
infected and noninfected hind paws were removed, frozen in liquid nitrogen and stored 
at "80°C until further analysis. The mouse blood samples were stored at 4°C for 24 h, 

10 clotted material was spun down for 20 min at 13,200 rpm, and the immune serum was 
removed and stored at "20°C until use. 

Estimation of Total Numbers of L. major in Infected Mouse Tissue. 
Estimation of total numbers of L. major in the infected popliteal ganglion was 
performed by a limiting dilution assay (Titus, R. G. et al 1985) and measurement of 

1 5 reporter gene activity following the procedure outlined by Berube et al (1996). 

Measurement of Antibody. PT3- or AgfA-specific antibodies were 
measured by the ELISA method (Engvall and Carlsson, 1976) involving 96-well DNA- 
bind plates (Corning-Costar) coated with either PT3 peptide, polymerized SEF17 or 
depolymerized SEF17 pre-treated with 90% formic acid at a concentration of 1 jag/well. 

20 Immune serum from each BALB/c mouse was tested in triplicate at a dilution of 1 in 
100. 

Statistics. Statistical significance was analyzed by determining the 
standard error of the mean (n = 5). If the mean values +/- standard error for different 
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groups of mice did not overlap, the results were considered significant (Duncan et aL, 
1977). 

Results 

5 Optimizing Expression of SEF17 Fimbrial Fibers Displaying the PT3 

Epitope. To determine the optimal conditions for expression of chimeric SEF17 fibers, 
S. enteritidis strains A4, A5, A8 were grown in TSB and T broth for 6, 24 or 48 h, static 
or shaking at 200 rpm. The cell pellets were harvested and the amount of cell- 
associated fimbrial material was determined by SDS-PAGE and immunoblotting. 

10 Strains A4 ? A5 and A8 all produced maximal amounts of cell-associated AgfA::PT3 
when grown for 48 h in T broth, shaking at 200 rpm (Table 9). The S. enteritidis 3b 
parental strain produced large amounts of cell-associated AgfA under most T broth 
conditions and was generally less variable in the amount detected with the cell pellet 
(Table 9). Cell-associated AgfA or AgfA::PT3 were not detected when strains were 

15 grown in TSB (data not shown). 

Strain A4 has been characterized further than other strains containing 
agfA::PT3 (data not shown). Therefore, S. enteritidis A4 was chosen to be tested as a 
vaccine against L. major. 

20 Table 9. Expression of cell-associated chimeric SEF17 under different growth 
conditions. 

Production of cell-associated SEF17 b 

6h 24 h 48 h 
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Strain a 0 rpm c 200 rpm 0 rpm 200 rpm Orpm 200 rpm 

3b - +++ +++ +++ +++ 

A4 - + ++ + ++ 

A5 - ++ + ++ +++ 

5 A8 - + + + +++ 



a S. enteritidis 27655-3b strains: parental strain 3b; agfA;:PT3 (A4, A5, A8). 
b S. enteritidis strains were assessed for production of cell-associated chimeric SEF17 
production after growth in T broth by SDS-PAGE and immunoblotting as described in 
10 Materials and Methods. Presence of AgfA::PT3 on immunoblots was scored as 
follows: equal to parental 3b AgfA levels (+++); slightly reduced levels (++); 
significantly reduced levels (+); not detectable (-) 
c 0 rpm refers to static growth and 200 rpm refers to shaken cultures. 

15 Vaccine Dosage and Route of Delivery. Preliminary experiments to test 

the virulence of S. enteritidis 3 b in BALB/c mice showed that a safe oral vaccination 
dose was 10 5 organisms/mouse. After administration of this dose to BALB/c mice, no 
deaths were recorded after 10 days and no CFUs of S. enteritidis were detected in the 
spleens and livers of mice after 28 days of infection (data not shown). 

20 Vaccination of BALB/c Mice and Subsequent Challenge With Z. major. 

The experimental outline for S. enteritidis vaccination and L. major challenge is shown 
in Table 10. BALB/c mice were vaccinated once with 10 5 cells of S. enteritidis A4 or 
3b. Control mice were administered an equal volume of PBS orally. Two weeks later, 
selected groups of mice received a second oral dose of 10 3 cells of S. enteritidis A4 or 

25 3b. Vaccinated mice were challenged in the footpad with L. major either 4, 6 or 8 
weeks after the first vaccination and footpad swelling was monitored. Four weeks after 
Z. major challenge, group 1 mice (challenged 4 weeks after first vaccination) were 
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sacrificed; there were no major differences in footpad inflammation between any of the 
mice (data not shown). There was notable increase in footpad inflammation with group 
2 and 3 mice vaccinated with S. enteritidis A4 as compared to PBS-vaccinated control 
mice (Fig. 24 A-D). The largest difference between these two groups was observed in 
5 group 3 mice challenged with Z. major 8 weeks after vaccination (Fig. 24 C, D). In 
general, footpad inflammation in mice vaccinated with 5. enteritidis 3b or A4 was very 
similar. 

Table 10. S. enteritidis vaccination of BALB/c mice and challenge with L. major. 



Inoculum 3 


Vaccination b 




Leishmania major challenge 0 




DayO 


Day 14 Week 4 


Week 6 


Week 8 


PBS 


15 d 




5 


5 


5 


3b 


15 




5 


5 


5 


A4 


15 




5 


5 


5 


PBS 


15 


15 


5 


5 


5 


3b 


15 


15 


5 


5 


5 


A4 


15 


15 


5 


5 


5 



20 a S. enteritidis 27655-3b strains: parental strain 3b; A4 (agfA::PT3#4). 

b Oral immunization of BALB/c mice with S. enteritidis 3b or A4 by gavage: 10 5 CFU 
in 200 |il PBS administered at Day 0, 10 3 CFU in 200 \il PBS administered at Day 14. 
0 Subcutaneous injection of live L. major into the right hind footpad of vaccinated 
BALB/c mice: 10 7 parasites in 50 pi PBS administered at Week 4 or 6, 5 x 10 6 parasites 

25 in 50 yd PBS administered at Week 8. 

d Numbers listed in table refer to numbers of BALB/c mice in each group. 



Parasite Load in the Popliteal Lymph Nodes. At the end of the L. major 
challenge experiment, mice were sacrificed and the right popliteal lymph nodes 
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removed. Lymph node cells were homogenized and parasite load was evaluated by 
limited dilution assay (Titus et ah, 1985) and measurement of reporter gene activity. 
Total reporter gene units and reporter gene units per mg of organ are represented for the 
30 group 3 BALB/c mice challenged with L. major 8 weeks after vaccination (Fig. 25 
5 A, B). Popliteal lymph nodes from mice vaccinated once with S. enter itidis A4 had 
significantly lower reporter gene activity than the PBS-vaccinated control mice (Fig. 
25B; group 3-3 versus 3-1). However, reporter gene activity levels for group 3-3 were 
higher than mice from control group 3-2, which received one dose of wild-type S. 
enteritidis 3b. More promising results were obtained for mice from group 3-6, which 

10 received two doses of the vaccine strain, S. enteritidis A4. Reporter gene activity levels 
in the lymph node tissue from these mice were significantly lower than both the PBS- 
vaccinated and S. enteritidis 3b-vaccinated groups of mice (Fig. 25B, groups 3-4, 3-5). 
This indicated that two vaccinations of BALB/c mice with & enteritidis A4, which 
displays the L. major PT3 epitope at the cell surface, was enough to reduce the overall 

1 5 numbers of parasites. 

Measurement of Antibody Levels in Mouse Serum. Immune sera from 
infected BALB/c mice were screened by ELISA for recognition of antigens carried by 
the S. enteritidis vaccine strain A4, specifically PT3 peptide and SEF17/AgfA (Fig. 26 
A-C). The strongest antibody response against PT3 (Fig. 26A) and polymerized SEF17 

20 (Fig. 26B) was observed in mice from group 3-6, the group which received two 
vaccinations of S. enteritidis A4. These mean values were boosted by the strong 
response of one individual mouse in this group (data not shown). When immune sera 
were tested against SEF17 fimbriae which had been depolymerized into AgfA 
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monomers by treatment with 90% formic acid (Fig. 26C), group 3-6 again had a strong 
antibody response. These results indicated that several mice vaccinated with S. 
enteritidis A4 had produced antibodies specific for chimeric SEF17 fimbriae carrying 
the PT3 eptiope. 

5 

Discussion 

In summary, two oral vaccinations with S. enteritidis A4 ? displaying 
chimeric fimbriae containing the PT3 epitope from L. major gp63 ? caused a reduction in 
overall numbers of L. major in the popliteal lymph nodes with respect to S. enteritidis 

10 3b- or PBS-vaccinated control mice. This was recorded even though footpad 
inflammation had significantly increased in the S. enteritidis A4 vaccinated mice with 
respect to control mice. Also, mice from the S. enteritidis A4 vaccinated group 
developed a significant humoral immune response against both the PT3 epitope as well 
as the AgfA protein which was carrying it. Therefore, there is clear evidence that 

15 presentation of the PT3 epitope in the context of SEF17 by S. enteritidis A4 was 
successful. 

Several studies have reported vaccination of BALB/c mice against 
Leishmania by administering recombinant Salmonella vaccine strains producing the 
gp63 protein from L. major (Gonzalez et al., 1998; McSorley et al. 9 1997; Xu et aL, 
20 1995). The authors correlate a decrease in footpad inflammation after L. major 
challenge with decreased parasite loads and protection against L. major infection. 
Olivier et al (1998) correlated decreased footpad inflammation in vaccinated BALB/c 
mice with a drop in parasite load in the popliteal lymph nodes. Here a similar drop in 
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parasite load in the popliteal ganglion of vaccinated mice occurred, but footpad 

inflammation had increased. 

These differences may be due to the use of S. enteritidis as the carrier 

vaccine strain. 5. enteritidis has been widely studied as a vaccine candidate chiefly to 
5 immunize poultry in a effort to reduce Salmonella load in chicken and egg products 

destined for human consumption. S. enteriditis can produce a sublethal, chronic 

infection in mice that is cleared over time (unpublished data), however, this Salmonella 

serotype is not normally studied as a carrier of heterologous antigens for vaccine 

efficacy trials in mice. It is not well characterized whether S. enteritidis will reach the 
10 appropriate mouse tissues to achieve optimum immune response against foreign 

epitopes which it is carrying. Hence, it is unknown what effect oral vaccination with S. 

enteritidis has on the BALB/c mice prior to challenge with L. major. In addition, S. 

enteritidis 3b has not been specifically attenuated for vaccine purposes. 

Of all the Salmonella serovars available, S. typhimurium, the causative 
15 agent of murine typhoid, and S. typhi, the causative agent of human typhoid are the 

most well characterized with respect to vaccine development (Chatfield et aL, 1992; 

Chatfield et a/., 1993; Curtiss et aL, 1994; Roberts et a!., 1994; Levine et al., 1996). 

Presentation of our chimeric fimbriae in the context of one of these well-characterized 

vaccine strains may lead to even greater success with the epitope delivery system 
20 described herein. Assembly of chimeric SEF17 fimbriae in these strains would also 

occur since the agfA operon and assembly systems are highly conserved among the 

Salmonellae (Romling et al 9 1998; Baumler et al 9 1997; Collinson et al., 1996). 

Nevertheless, vaccination of BALB/c mice with S. enteritidis presenting the Leishmania 
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PT3 epitope resulted in generation of a specific immune response against the 
heterologous epitope and an overall reduction of L. major cell numbers in the 
vaccinated host. 
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CLAIMS 

1. A recombinant agfA gene in which a segment of the gene has been 
replaced by a segment of a foreign DNA sequence, that foreign sequence encoding a foreign 
epitope or antigen. 

2. A recombinant agfA gene in which one or more segments of the gene 
have been replaced by segments of foreign DNA sequence(s), the foreign sequence(s) 
encoding a foreign epitope(s) or antigen(s). 

3. The recombinant gene according to any one of claims 1 and 2 wherein 
the said recombinant agfA gene is present in an expression vector construct capable of 
producing recombinant AgfA fimbrin proteins in a strain of Salmonella. 

4. The recombinant gene according to any one of claims 1 and 2 wherein 
the said recombinant agfA gene is present in an expression vector construct capable of 
producing recombinant AgfA fimbrin proteins in a strain of E. coli. 

5. The recombinant gene according to any one of claims 1 and 2 wherein 
the said recombinant agfA gene is present in an expression vector construct capable of 
producing recombinant AgfA fimbrin proteins in a strain of Enerobacteriaceae. 

6. The recombinant gene according to any one of claims 1 and 2 wherein 
the said recombinant agfA gene is present in an expression vector construct capable of 
producing stable fimbriae comprising recombinant AgfA protein in a strain of Salmonella. 

7. The recombinant gene according to any one of claims 1 and 2 wherein 
the said recombinant agfA gene is present in an expression vector construct capable of 
producing stable fimbriae comprising recombinant AgfA protein in a strain of E. coli. 
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8. The recombinant gene according to any one of claims 1 and 2 wherein 
the said recombinant agfA gene is present in an expression vector construct capable of 
producing stable fimbriae comprising recombinant AgfA protein in a strain of 
Enterobacteriaceae . 

9. The recombinant gene according to any one of claims 1-3, 6 wherein 
the said recombinant agfA gene is present in the chromosome of a strain of Salmonella. 

10. The recombinant gene according to any one of claims 1-3, 6 wherein 
the said recombinant agfA gene is present in the chromosome of a strain of Salmonella, as a 
replacement of the native agfA gene. 

11. The recombinant gene according to any one of claims 1,2,4,7 wherein 
the said recombinant agfA gene is present in the chromosome of a strain of E. coll 

12. The recombinant gene according to any one of claims 1,2,4,7 wherein 
the said recombinant agfA gene is present in the chromosome of a strain of E. coll as a 
replacement of the native csgA gene. 

13. The recombinant gene according to any one of claims 1,2,5,8 wherein 
the said recombinant agfA gene is present in the chromosome of a strain of 
Enterobacteriaceae, 

14. The recombinant gene according to any one of claims 1,2,5,8 wherein 
the said recombinant agfA gene is present in the chromosome of a strain of 
Enterobacteriaceae as a replacement of the native homologous gene. 

15. The recombinant gene according to any one of claims 1-14 wherein 
said recombinant gene is an agfB recombinant gene. 
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16. The recombinant gene according to any one of claims 1-14 wherein 
said recombinant gene is an csgA recombinant gene. 

17. The recombinant gene according to any one of claims 1-14 wherein 
said recombinant gene is an csgB recombinant gene. 

18. The recombinant gene according to any one of claims 1-14 wherein 
said recombinant gene is an Enterobacteriaceae homologue of the agfA or agfB recombinant 
genes. 

19. The recombinant gene according to any one of claims 9, 10 wherein 
said recombinant strain of Salmonella is capable of producing recombinant AgfA fimbrin 
proteins. 

20. The recombinant gene according to any one of claims 11, 12 wherein 
said recombinant strain of E. coli is capable of producing recombinant AgfA fimbrin proteins. 

21. The recombinant gene according to any one of claims 13, 14 wherein 
said recombinant strain of Enterobacteriaceae is capable of producing recombinant AgfA 
fimbrin proteins. 

22. The recombinant gene according to any one of claims 19-21 wherein 
said recombinant fimbrin protein is a recombinant AgfB fimbrin protein. 

23. The recombinant gene according to any one of claims 19-21 wherein 
said recombinant fimbrin protein is a recombinant CsgA fimbrin protein. 

24. The recombinant gene according to any one of claims 19-21 wherein 
said recombinant fimbrin protein is a recombinant CsgB fimbrin protein. 
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25. The recombinant gene according to any one of claims 19-21 wherein 
said recombinant fimbrin protein is a recombinant homologue of an AgfA or AgfB fimbrin 
protein. 

26. The recombinant gene according to any one of claims 9, 10 wherein 
said recombinant strain of Salmonella is capable of producing stable fimbriae comprising 
recombinant AgfA fimbrin protein. 

27. The recombinant gene according to any one of claims 11, 12 wherein 
said recombinant strain of E. coli is capable of producing stable fimbriae comprising 
recombinant AgfA fimbrin protein. 

28. The recombinant gene according to any one of claims 13, 14 wherein 
said recombinant strain of Enterobacteriaceae is capable of producing stable fimbriae 
comprising recombinant AgfA fimbrin protein. 

29. The recombinant gene according to any one of claims 26-28 wherein 
said fimbriae are comprised of recombinant AgfB fimbrin protein. 

30. The recombinant gene according to any one of claims 26-28 wherein 
said fimbriae are comprised of recombinant CsgA fimbrin protein. 

31. The recombinant gene according to any one of claims 26-28 wherein 
said fimbriae are comprised of recombinant CsgB fimbrin protein. 

32. The recombinant gene according to any one of claims 26-28 wherein 
said fimbriae are comprised of recombinant AgfA and AgfB fimbrin proteins. 

33. The recombinant gene according to any one of claims 26-28 wherein 
said fimbriae are comprised of recombinant CgsA and CsgB fimbrin proteins. 
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34. The recombinant gene according to any one of claims 26-28 wherein 
said fimbriae are comprised of recombinant homologue of an AgfA or AgfB fimbrin protein. 

35. The use of the SEF17/TAF nucleation dependent assembly system of 
strains of Salmonella for the production of fimbriae comprising recombinant AgfA subunits. 

36. The use of the SEF17/TAF homologue curli nucleation dependent 
assembly system of strains of E. coli for the production of fimbriae comprising recombinant 
CsgA subunits. 

37. The use of the SEF17/TAF homlogues nucleation dependent assembly 
systems of strains of Enterobacteriaceae for the production of fimbriae comprising 
recombinant, AgfA-homologue fimbrin subunits. 

38. A method of eliciting an immune response in an animal comprising: 

(a) separating an amino acid polymer comprising a recombinant AgfA 
protein containing a replacement segment or segments of foreign amino acid sequence or 
sequences grown on a Salmonella host cell from said Salmonella host cell; and 

(b) introducing said amino acid polymer into said animal in conjunction 
with a physiological carrier or diluent. 

39. A method of eliciting an immune response in an animal comprising: 

(a) separating an amino acid polymer comprising a recombinant AgfA 
protein containing a replacement segment or segments of foreign aminoacid sequence or 

sequences grown on a E. coli host cell from said E. coli host cell; and 

(b) introducing said amino acid polymer into said animal in conjunction 
with a physiological carrier or diluent. 



40. A method of eliciting an immune response in an animal comprising: 
(a) separating an amino acid polymer comprising a recombinant AgfA protein 
containing a replacement segment or segments of foreign aminoacid sequence or sequences 
grown on a Enterobacteriaceae host cell from said Enterobacteriaceae host cell; and 
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(b) introducing said amino acid polymer into said animal in conjunction with a 
physiological carrier or diluent. 

41. The recombinant gene according to any one of claims 38-40 wherein 
said recombinant protein is AgfB recombinant protein. 

42. The recombinant gene according to any one of claims 38-40 wherein 
said recombinant protein is CsgA recombinant protein. 

43. The recombinant gene according to any one of claims 38-40 wherein 
said recombinant protein is CsgB recombinant protein. 

44. The recombinant gene according to any one of claims 38-40 wherein 
said recombinant protein is an AgfA or AgfB homologue recombinant protein. 

45. A method of eliciting an immune response in an animal comprising: 

(a) separating a fimbriae comprising a recombinant AgfA protein 
containing a replacement segment or segments of foreign amino acid sequence or sequences 
grown on a Salmonella host cell from said Salmonella host cell; and 

(b) introducing said fimbriae into said animal in conjunction with a 
physiological carrier or diluent. 

46. A method of eliciting an immune response in an animal comprising: 

(a) separating a fimbriae comprising a recombinant AgfA protein 
containing a replacement segment or segments of foreign amino acid sequence or sequences 
grown on a E. coli host cell from said E. coli host cell; and 

(b) introducing said fimbriae into said animal in conjunction with a 
physiological carrier or diluent. 
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47. A method of eliciting an immune response in an animal comprising: 

(a) separating a fimbriae comprising a recombinant AgfA protein 
containing a replacement segment or segments of foreign amino acid sequence or sequences 
grown on a Enterobacteriaceae host cell from said Enterobacteriaceae host cell; and 

(b) introducing said fimbriae into said animal in conjunction with a 
physiological carrier or diluent. 

48. The recombinant gene according to any one of claims 45-47 wherein 
said fimbriae is comprised of an AgfB recombinant protein. 

49. The recombinant gene according to any one of claims 45-47 wherein 
said fimbriae is comprised of an CsgA recombinant protein. 

50. The recombinant gene according to any one of claims 45-47 wherein 
said fimbriae is comprised of an CsgB recombinant protein. 

51. The recombinant gene according to any one of claims 45-47 wherein 
said fimbriae is comprised of an AgfA or AgfB homologue recombinant protein. 

52. The recombinant gene according to any one of claims 45-47 wherein 
said fimbriae is comprised of an AgfA and AgfB homologue recombinant protein. 

53. The recombinant gene according to any one of claims 45-47 wherein 
said fimbriae is comprised of an CsgA and CsgB homologue recombinant protein. 

54. A method for directing recombination of a recombinant gene into the 
chromosome of the homologous species. 

55. A method for directing recombination of a recombinant gene back into 
the chromosome of the homologous species thereby replacing the native copy of that gene. 
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BACTERIAL FIMBRIAL SYSTEM FOR PRESENTATION OF HETEROLOGOUS 

PEPTIDE SEQUENCES 

ABSTRACT OF THE DISCLOSURE 

The invention provides a system for creating recombinant agfA fimbrin genes 
and performing chromosomal gene replacements within Salmonella, creating Salmonella 
strains which carry the recombinant agfA genes at the native position in the chromosome. 
One embodiment of the invention is exemplified by the expression of a model epitope (PT3) 
obtained from the GP63 protein of Leishmania major, by formation of recombinant agfA 
genes encoding PT3 fusion proteins recombined at 10 different sites throughout the agfA 
gene. These fusions are shown to be expressed in the thin aggregative fimbriae on the surface 
of bacterial cell. The AgfA fimbrin of Salmonella (CsgA for E. coli) provides a flexible and 
stable vehicle for the expression of foreign epitopes in enterobacteriaceae and the subsequent 
thin aggregative fimbriae (curli) expression product provide an ideal organelle for 
presentation of the foreign epitopes at the cell surface. 
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Weeks postL major infection 
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SEQUENCES 

SEQ ID #1 

agfA DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 

SEQ ID #2 

agfB DNA Sequence 

ATGAAAAACAAATTGTTATTTATGATGTTGACAATACTGGGTGCGCCTGGGATTGCAACCGC 
GACAAATTATGATCTGGCTCGTTCAGAATATAATTTTGCGGTAAATGAATTAAGCAAGTCTT 
CATTTAATCAGGCGGCCATTATTGGTCAAGTCGGCACGGATAATAGTGCCAGAGTACGCCAG 
GAAGGATCAAAACTATTGTCCGTTATTTCACAAGAAGGAGGAAATAATCGGGCGAAAGTCGA 
CCAGGCAGGGAATTATAACTTTGCGTATATTGAGCAAACGGGCAATGCCAACGATGCCAGTA 
TATCGCAAAGCGCTTACGGTAATAGTGCAGCTATTATCCAGAAAGGTTCTGGAAATAAGGCC 
AATATTACCCAGTACGGTACGCAGAAAACAGCAGTTGTAGTGCAGAAACAGTCGCATATGGC 

TATTCGCGTCACCCAACGCTAA 

SEQ ID #3 

csgA DNA Sequence 

ATGAAACTTTTAAAAGTAGAAGCAATTGCAGCAATCGTATTCTCCGGTAGCGCTCTGGCAGG 
TGTTGTTCCTCAGTACGGCGGCGGCGGTAACCACGGTGGTGGCGGTAATAATAGCGGCCCAA 
ATTCTGAGCTGAACATTTACCAGTACGGTGGCGGTAACTCTGCACTTGCTCTGCAAACTGAT 
GCCCGTAACTCTGACTTGACTATTACCCAGCATGGCGGCGGTAATGGTGCAGATGTTGGTCA 
GGGCTCAGATGACAGCTCAATCGATCTGACCCAACGTGGCTTCGGTAACAGCGCTACTCTTG 
ATCAGTGGAACGGCAAAAATTCTGAAATGACGGTTAAACAGTTCGGTGGTGGCAACGGTGCT 
GCAGTTGACCAGACTGCATCTAACTCCTCCGTCAACGTGACTCAGGTTGGCTTTGGTAACAA 

CGCGACCGCTCATCAGTACTAA 

SEQ ID #4 

csgB DNA Sequence 

ATGAAAAACAAATTGTTATTTATGATGTTAACAATACTGGGTGCGCCTGGGATTGCAGCCGC 
AGCAGGTTATGATTTAGCTAATTCAGAATATAACTTCGCGGTAAATGAATTGAGTAAGTCTT 
CATTTAATCAGGCAGCCATAATTGGTCAAGCTGGGACTAATAATAGTGCTCAGTTACGGCAG 
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GGAGGCTCAAAACTTTTGGCGGTTGTTGCGCAAGAAGGTAGTAGCAACCGGGCAAAGATTGA 
CCAGACAGGAGATTATAACCTTGCATATATTGATCAGGCGGGCAGTGCCAACGATGCCAGTA 
TTTCGCAAGGTGCTTATGGTAATACTGCGATGATTATCCAGAAAGGTTCTGGTAATAAAGCA 
AATATTACACAGTATGGTACTCAAAAAACGGCAATTGTAGTGCAGAGACAGTCGCAAATGGC 
TATTCGCGTGACACAACGTTAA 

SEQ ID #5 

AgfA protein Sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 
ARKSETT I TQSGYGNGADVGQGADNST I ELTQNGFRNNAT IDQWNAKNSD I TVGQYGGNNAA 
LVNQTASDSSVMVRQVGFGNNATANQY 

SEQ ID #6 

AgfB protein Sequence 

MKNKLLFMMLTILGAPGIATATNYDLARSEYNFAVNELSKSSFNQAAIIGQVGTDNSARVRQ 
EGSKLLSVISQEGGNNRAKVDQAGNYNFAYIEQTGNANDASISQSAYGNSAAIIQKGSGNKA 

N I TQ YGTQKTAVWQKQSHMAI RVTQR 
SEQ ID #7 

CsgA protein sequence 

MKLLKVAAIAAIVFSGSALAGWPQYGGGGNHGGGGNNSGPNSELNIYQYGGGNSALALQTD 
ARNSDLTITQHGGGNGADVGQGSDDSSIDLTQRGFGNSATLDQWNGKNSEMTVKQFGGGNGA 

AVDQTASNSSVNVTQVGFGNNATAHQY 
SEQ ID #8 

CsgB protein sequence 

MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQLRQ 
GGS KLLAWAQE GS SNRAK I DQTGD YNLAY I DQAGSAND AS I SQGAYGNTAM 1 1 QKGS GNKA 
N I TQYGTQKTAI WQRQS QMAI RVTQR 

SEQ ID #9 

PT3 epitope DNA sequence 

TATGATCAGCTGGTTACCCGTGTTGTTACCCATGAAATGGCACATGCA 
SEQ ID #10 

PT3 epitope protein sequence 
YDQLVTRWTHEMAHA 

SEQ ID #11 

AgfA::PT3#l DNA sequence 
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ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAAT TATGATCAGCTGGTTACCCGTGTTGTTACCCATGAA ATGGCACATGCAAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #12 

AgfA::PT3#l protein sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 
ARKSETTITQSGYGNGADVGQGADNSTIELTQNGFRNNATIDQWNAKNSDITVGQYGGNNAA 

T iVNVn pT iVTRVVTHEMAHAN NATANQY 
SEQ ID #13 

AgfA::PT3#2 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 

.\C(r-CTCC.V-?Ci / 7T? 1 ft ft ft Ar. T frv2ATaTTarTnTPfy}rr&&TATPtATC!AQCTGGTTACCCGT 
GTTGTTACCCATGAAATGGCACATGCA AGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #14 

AgfA::PT3#2 protein sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 
ARKSETT I TQSGYGNGADVGQGADNS T I ELTQNGFRNNAT I DQWNAKNS D I TVGQ YDQLVTR 
WTHEMAHASVMVRQVGFGNNATANQY 



SEQ ID #15 

AgfA::PT3#3 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTC TATGATCAGCTGGTTACCCGTGTTGTTACCCATGAAATGGCACATGCA TCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
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CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 
CGCCACGGCTAACCAGTATTAA 

SEQ ID #16 

AgfA::PT3#3 protein sequence 

M yT.T.T^7aaffaaT^A^.qa.qAT,A(? VYDOLVTRWTHEMAHA SGPDSTLSIYQYGSANAALALQSD 
ARKSETTITQSGYGNGADVGQGADNSTIELTQNGFRNNATIDQWNAKNSDITVGQYGGNNAA 

LVNQTASD S S VMVRQVGFGNNAT ANQY 
SEQ ID #17 

AgfA: :PT3#4 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGGGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ar TATGATCAGCTGGTTACCCGTGTTGTTACCCATGAAATGGCACATGCA CTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #18 

AgfA::PT3#4 protein sequence 

ARKSETTITQSGYGNGADVGQGADNSTIELTQNGFRNNATIDQWNAKNSDITVGQYGGNNAA 
LVNQTASD S S VMVRQVGFGNNATANQY 

SEQ ID #19 

AgfA::PT3#5 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
ftCCCGTAA ATATGATCAGCTGGTTACCCGTGTTGTTACCCATG AAATGGCACATGCAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #20 

AgfA::PT3#5 protein sequence 
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MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 
AP T CYDOT iVTRWTHEMAHA GOGADNST I ELTQNGFRNNAT I DQWNAKNSD I TVGQYGGNNAA 
LWQTASDSSVMVRQVGFGNNATANQY 

SEQ ID #21 

AgfA: :PT3#6 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
r,nr,Tr?rr^ATAAT TATGATCAGCTGGTTACCCGTGTTGTTACCCAT GAAATGGCACATGCAG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #22 

AgfA::PT3#6 protein sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 
ARKS T?^TTTn.c;nvnMnAnvmGAnW YDOT,VTRVVTHEMAHA DOWNAKNSDI TVGQYGGNNAA 
LVNQTASD S S VMVRQVGFGNNATANQY 

SEQ ID #23 

AgfA::PT3#7 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
APPartTnnAAPGrTAAAAAC TATGATCAGCTGGTTACCCGTGTTGTTACCCAT GAAATGGCA 
CATGCAA ATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #24 

AgfA::PT3#7 protein sequence 

MKLLKVAAFAAI WSGSALAGVVPQWGGGGNHNGGGNS SGPDSTLSI YQ YGS ANAALALQSD 
ARKSETT ITQSGYGNGADVGQGADNST I ELTQNGFRNNAT I DOWNATCNYDOLVTRVVTHEMA 
HANQTASDSSVMVRQVGFGNNATANQY 

SEQ ID #25 

AgfA::PT3#8 DNA sequence 
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ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTTATGATCAG^TGGTT 
ACCCGTGTTGTTACCCATGAAATGGCACATGCA GGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #26 

AgfA: :PT3#8 protein sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALYDQLV 
TRWTHEMAHA GYGNGADVGQGADNSTIELTQNGFRNNATIDQWNAKNSDITVGQYGGMNAA 

LVNQTASDSSVMVRQVGFGNNATANQY 
SEQ ID #27 

AgfA::PT3#9 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GC C CGT AAAT CTGAAACGAC CATTACC CAGAGCGGTT ATGGTAACGGCGCCGATTATGAT CA 
GCTGGTTACCCGTGTTGTTACCCATGAAATGGCACATGCA TTCAGAAATAATGCCACCATCG 
ACCAGTGGAACGCTAAAAACTCCGATATTACTGTCGGCCAATACGGCGGTAATAACGCCGCG 
CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 

CGCCACGGCTAACCAGTATTAA 
SEQ ID #28 

AgfA::PT3#9 protein sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 
ARKS ETT I TQ SGYGNGAD YDOLVTRWTHEMAHA FRNNAT I DQWNAKNS D I TVGQYGGNNAA 
LVNQTASDSSVMVRQVGFGNNATANQY 

SEQ ID #29 

AgfA: :PT3#10 DNA sequence 

ATGAAACTTTTAAAAGTGGCAGCATTCGCAGCAATCGTAGTTTCTGGCAGTGCTCTGGCTGG 
CGTCGTTCCACAATGGGGCGGCGGCGGTAATCATAACGGCGGCGGCAATAGTTCCGGCCCGG 
ACTCAACGTTGAGCATTTATCAGTACGGTTCCGCTAACGCTGCGCTTGCTCTGCAAAGCGAT 
GCCCGTAAATCTGAAACGACCATTACCCAGAGCGGTTATGGTAACGGCGCCGATGTAGGCCA 
GGGTGCGGATAATAGTACTATTGAACTGACTCAGAATGGTTTCAGAAATAATGCCACCTATG 
ATCAGCTGGTTACCCGTGTTGTTACCCATGAAATGGCACATGCAGGCGGTAATAACGCCGCG 
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CTGGTTAATCAGACCGCATCTGATTCCAGCGTAATGGTGCGTCAGGTTGGTTTTGGCAACAA 
CGCCACGGCTAACCAGTATTAA 

SEQ ID #3 0 

AgfA::PT3#10 protein sequence 

MKLLKVAAFAAIWSGSALAGWPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSD 

ARKSETT I TQ°GY^N^a™^C^ AnMgTTBT ' Tn ^^ 
LVNQTASD S S VMVRQ VGFGNNATANQ Y 
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